Add mixtral support #12

Vahe1994 · 2024-02-08T12:01:30Z

Added Mixtral support

justheuristic · 2024-02-20T13:42:19Z

main.py

            for sublayer_name in aq_handlers.keys():
                print(f"Quantizing module {sublayer_name} of layer {layer_index}")
+                if "mixtral" in model.config.model_type.lower() and args.mix_compression:
+                    print("mix_compression")


Suggested change

print("mix_compression")

justheuristic · 2024-02-20T13:45:38Z

main.py

            for sublayer_name in aq_handlers.keys():
                print(f"Quantizing module {sublayer_name} of layer {layer_index}")
+                if "mixtral" in model.config.model_type.lower() and args.mix_compression:
+                    print("mix_compression")
+                    assert args.nbits_per_codebook == 16


Nitpicking: this looks like a very specific configuration for a general option (--mix_compression)

if you have time, can you please

assert that model type is mixtral

double the number of codebooks for self-attn layers

use standard number of codebooks for non self-attn layers

If you don't have time, please

merge this PR

create an issue, saying

PR https://github.com/Vahe1994/AQLM/pull/12/ adds a very specific configuration for--mix_compression. It would be great to generalize it to something more generic. For instance, we can - assert that model type is mixtral - double the number of codebooks for self-attn layers - use standard number of codebooks for non self-attn layers

will do, thanks!

justheuristic

LGTM (with optional requsts in the comments)

Vahe1994 added 3 commits February 8, 2024 15:55

Add mixtral support

b0454f9

Black and isort

e46ebbf

Renaming mixtrl_layer_ident to layer_ident

5275769

justheuristic reviewed Feb 20, 2024

View reviewed changes

justheuristic approved these changes Feb 20, 2024

View reviewed changes

Vahe1994 added 3 commits February 21, 2024 21:44

Added asserts and made mix_compression for 2x on attn and x on experts

fc227f7

Added asserts and made mix_compression for 2x on attn and x on experts

c221017

isort and black

dd4545f

Vahe1994 merged commit 8f75dcd into main Feb 21, 2024
2 checks passed

Vahe1994 deleted the mixtral_support branch February 21, 2024 17:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mixtral support #12

Add mixtral support #12

Vahe1994 commented Feb 8, 2024

justheuristic Feb 20, 2024

justheuristic Feb 20, 2024

Vahe1994 Feb 20, 2024

justheuristic left a comment

Add mixtral support #12

Add mixtral support #12

Conversation

Vahe1994 commented Feb 8, 2024

justheuristic Feb 20, 2024

Choose a reason for hiding this comment

justheuristic Feb 20, 2024

Choose a reason for hiding this comment

Vahe1994 Feb 20, 2024

Choose a reason for hiding this comment

justheuristic left a comment

Choose a reason for hiding this comment