Md. Abdus Sahid abdussahid26

🎯

Focusing

My research interests lie in Visual-language models (VLMs), large language models (LLMs), multimodal AI, and federated learning.

5 followers · 2 following

abdussahid26/README.md

Hi there 👋

Pinned Loading

Paligemma-Multimodal-Vision-Language-Model-from-Scratch Paligemma-Multimodal-Vision-Language-Model-from-Scratch Public

This repository provides an implementation of PaliGemma, a state-of-the-art multimodal vision-language model (VLM) released by Google DeepMind.

Python 1 1
GPT-2-Model-from-Scratch-to-Generate-Text GPT-2-Model-from-Scratch-to-Generate-Text Public

Implementation of a GPT-2 model from scratch for text generation. This repository also includes instructions on scaling the training of the 'GPT-2 model from scratch for text generation' across a c…

Jupyter Notebook 1
RAG-from-Scratch-with-PyTorch RAG-from-Scratch-with-PyTorch Public

Here's a complete guide on building a Retrieval-Augmented Generation (RAG) pipeline from scratch and running it entirely locally. This approach ensures privacy, avoids API costs, and allows full co…

Jupyter Notebook
LLM-Post-training-Techniques LLM-Post-training-Techniques Public

This repository contains implementations of LLM post-training techniques, including SFT, PEFT, RLHF, PPO, DPO, and more.

Jupyter Notebook
Transformer-Architecture-from-Scratch-with-PyTorch Transformer-Architecture-from-Scratch-with-PyTorch Public

This is an implementation of the Transformer architecture from scratch, based on "Attention Is All You Need."

Python
DeepSeek-R1-Fine-tuning-for-Code-Generation DeepSeek-R1-Fine-tuning-for-Code-Generation Public

General fine-tuning of the DeepSeek-R1-Distill-Qwen-1.5B model for code generation tasks.

Jupyter Notebook 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Md. Abdus Sahid abdussahid26

Block or report abdussahid26

Hi there 👋

Pinned Loading

Uh oh!