NLA_team13: Analyzing the Evolution of Weight Matrices in the OPT-13B LLM During Training

This project focuses on the analysis of weight matrices in the pythia-410M Large Language Model (LLM) and its training checkpoints.

We investigate the matrices associated with the attention mechanism, including Key, Query, Value, and Output Projection matrices, as well as the token embedding matrix and the weight matrices of the MLP for each layer and for every training checkpoint.

Our analysis will explore how the characteristics of these matrices evolve during the training process through key metrics: examination of their spectral properties

assessment of matrices orthogonality
evaluation of sparsity across all matrices
analysis matrices weights evolution in different matrix norms

By employing a comprehensive approach to these characteristics, we aim to enhance our understanding of LLM training dynamics and contribute to discussions on model interpretability and optimization. This work builds on existing literature that emphasizes the importance of understanding weight distribution properties during training, providing empirical insights that can inform future research in model architecture and training methodologies.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
NLA_project.ipynb		NLA_project.ipynb
NLA_project_biases (2).ipynb		NLA_project_biases (2).ipynb
NLA_project_correlation (1).ipynb		NLA_project_correlation (1).ipynb
NLA_project_layernorm (1).ipynb		NLA_project_layernorm (1).ipynb
NLA_project_orthogonality_dinamics.ipynb		NLA_project_orthogonality_dinamics.ipynb
NLA_project_ortogonality (1).ipynb		NLA_project_ortogonality (1).ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLA_team13: Analyzing the Evolution of Weight Matrices in the OPT-13B LLM During Training

About

Uh oh!

Releases

Packages

Languages

rainbowbrained/NLA_team13

Folders and files

Latest commit

History

Repository files navigation

NLA_team13: Analyzing the Evolution of Weight Matrices in the OPT-13B LLM During Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages