Skip to content
View kadamrahul18's full-sized avatar

Highlights

  • Pro

Organizations

@hackforla

Block or report kadamrahul18

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
kadamrahul18/README.md

Hi, I'm Rahul Kadam 👋

MS Computer Engineering @ NYU. I work on ML systems: distributed training benchmarks, evaluation tooling, and reproducible experimentation.

🔭 Engineering Focus

  • Distributed Training & Benchmarking: fixed-work experiments, throughput/step-time measurement, multi-GPU scaling (PyTorch + DeepSpeed/ZeRO)
  • Systems Foundations: C++ projects (cycle-accurate simulators), Linux tooling, test harnesses
  • Open Source: Contributed to Opik (Comet ML) Python SDK — merged PR with unit tests + docs

🚀 Highlighted Projects

Project Tech Stack Evidence / Impact
Distributed LLM Training Benchmarks PyTorch, DeepSpeed, Slurm Fixed-work multi-GPU benchmarking for GPT-2 (124M); tokens/sec scaling and run artifacts for reproducibility.
MIPS Processor Simulator C++, Linux, Make Cycle-accurate 5-stage pipeline simulator with hazard detection + forwarding; verified via regression tests / traces.
Opik LLM Eval Platform (Merged PR #1006) Python, Pytest, Docs Added SentenceBLEU/CorpusBLEU metrics (NLTK-backed) + unit tests + docs; exported via opik.evaluation.metrics.
Brain Tumor Segmentation Baseline (MONAI 3D U-Net) PyTorch, MONAI, Linux, Slurm Reproducible training/eval pipeline on MSD Task01 with guardrails (ROI/label checks, metric conventions) and saved artifacts for reruns/plots.

🛠️ Tech Stack

  • Systems: Python, C++, Linux, Bash
  • ML Systems: PyTorch, DeepSpeed (ZeRO), testing (pytest), experiment reproducibility (configs/artifacts)
  • Tooling: Git, Docker

Pinned Loading

  1. GPT2-Optimization GPT2-Optimization Public

    GPT-2 (124M) fixed-work multi-GPU training benchmark on Slurm (V100) using DeepSpeed ZeRO-1 + AMP. Measured 1→4 GPU scaling (3.42× throughput) with reproducible run artifacts (configs + metrics JSO…

    Python

  2. comet-ml/opik comet-ml/opik Public

    Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

    Python 17.5k 1.3k

  3. Classification-of-MRI-images-for-Brain-Tumor-Using-Convolutional-Neural-Networks Classification-of-MRI-images-for-Brain-Tumor-Using-Convolutional-Neural-Networks Public

    Reproducible 3D brain tumor segmentation baseline using MONAI 3D U-Net (MSD Task01), with config-driven train/eval and GPU-ready execution on Slurm (NYU Big Purple). Includes correctness guardrails…

    Python

  4. CSA-Labs/mips-pipelined CSA-Labs/mips-pipelined Public

    Cycle-accurate 5-stage MIPS pipeline simulator in C++ (IF/ID/EX/MEM/WB) with hazard detection + forwarding. Includes per-cycle state tracking and regression tests/traces for correctness.

    C++ 1