Adding support for Llama 3.1 and Llama 3.2 models #59

CryVeck · 2024-12-18T05:19:23Z

The main modifications to support Llama 3.1 and 3.2:

In case of Llama 3.2 , tie_word_embedding=True so we need to do only one the rotation on input embedding as they are the same data has the output ones.
In case of Llama 3.2, as config.num_key_value_heads is different from config.num_attention_heads, we need to give the full formula : config.hidden_size * config.num_key_value_heads / config.num_attention_heads to get the right dimension.
Adding one Hadamard matrix

fixing token_wise rotation size when different from value size

Fixing rotation issue due to tie_weight

…ma3.2

Llama3.2 and Llama3

support for llama 3.1

yc2367 · 2025-07-09T00:01:16Z

This is very helpful for my current work comparing QuaRot on Llama-3.2. Thank you very much! Would appreciate if the author can review and merge if applicable. @sashkboos

CryVeck added 10 commits November 22, 2024 14:04

fixing token_wise rotation size when different from value size

7a0ac03

Merge pull request #3 from CryVeck/1-rotation-hidden-size

cc1162f

fixing token_wise rotation size when different from value size

Fixing rotation issue due to tie_weight

a11699a

Merge pull request #5 from CryVeck/2-rotation-destroy-perplexity

4505d95

Fixing rotation issue due to tie_weight

Support for Hadamar matrix for Llama 3.2 3B

e1ed874

Adding the new models into the supported models

3f6f884

Merge branch 'Llama3.2' of https://github.com/CryVeck/QuaRot into Lla…

125ea09

…ma3.2

Merge pull request #8 from CryVeck/Llama3.2

969772e

Llama3.2 and Llama3

support for llama 3.1

f2707f5

Merge pull request #9 from CryVeck/Llama3.2

c0211ef

support for llama 3.1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for Llama 3.1 and Llama 3.2 models #59

Adding support for Llama 3.1 and Llama 3.2 models #59

Uh oh!

CryVeck commented Dec 18, 2024

Uh oh!

yc2367 commented Jul 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Adding support for Llama 3.1 and Llama 3.2 models #59

Are you sure you want to change the base?

Adding support for Llama 3.1 and Llama 3.2 models #59

Uh oh!

Conversation

CryVeck commented Dec 18, 2024

Uh oh!

yc2367 commented Jul 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants