I am an Applied Scientist focused on building Small Language Models from scratch (pre-training, fine-tuning, distillation, dataset curation, etc). I also enjoy designing efficient software architecture and data pipelines.
Core Stack: Python, PyTorch, Transformers, PEFT, Boosting Models, AWS, GCP, dbt, SQL.

