Skip to content
View abdussahid26's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report abdussahid26

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
abdussahid26/README.md

Hi there 👋

Github states Most used languages

Pinned Loading

  1. Paligemma-Multimodal-Vision-Language-Model-from-Scratch Paligemma-Multimodal-Vision-Language-Model-from-Scratch Public

    This repository provides an implementation of PaliGemma, a state-of-the-art multimodal vision-language model (VLM) released by Google DeepMind.

    Python 1 1

  2. GPT-2-Model-from-Scratch-to-Generate-Text GPT-2-Model-from-Scratch-to-Generate-Text Public

    Implementation of a GPT-2 model from scratch for text generation. This repository also includes instructions on scaling the training of the 'GPT-2 model from scratch for text generation' across a c…

    Jupyter Notebook 1

  3. RAG-from-Scratch-with-PyTorch RAG-from-Scratch-with-PyTorch Public

    Here's a complete guide on building a Retrieval-Augmented Generation (RAG) pipeline from scratch and running it entirely locally. This approach ensures privacy, avoids API costs, and allows full co…

    Jupyter Notebook

  4. LLM-Post-training-Techniques LLM-Post-training-Techniques Public

    This repository contains implementations of LLM post-training techniques, including SFT, PEFT, RLHF, PPO, DPO, and more.

    Jupyter Notebook

  5. Transformer-Architecture-from-Scratch-with-PyTorch Transformer-Architecture-from-Scratch-with-PyTorch Public

    This is an implementation of the Transformer architecture from scratch, based on "Attention Is All You Need."

    Python

  6. DeepSeek-R1-Fine-tuning-for-Code-Generation DeepSeek-R1-Fine-tuning-for-Code-Generation Public

    General fine-tuning of the DeepSeek-R1-Distill-Qwen-1.5B model for code generation tasks.

    Jupyter Notebook 3