Pinned Loading
-
Reducing-Hallucinations-with-Direct-Preference-Optimization
Reducing-Hallucinations-with-Direct-Preference-Optimization PublicAn RLHF-inspired DPO framework that explicitly teaches LLMs when to refuse, significantly reducing hallucinations.
-
Decision-Transformer-from-Scratch-HalfCheetah-Minari-BC-vs-Return-Conditioned-DT
Decision-Transformer-from-Scratch-HalfCheetah-Minari-BC-vs-Return-Conditioned-DT PublicImplementing Decision Transformers from scratch for offline RL, benchmarking return-conditioned policies against Behavior Cloning.
Python
-
VulneraAI-agent
VulneraAI-agent PublicAn agentic LLM security scanner that analyzes applications against OWASP Top 10 using tool-calling, LangGraph, and AWS Bedrock.
Python
-
-
Multi-agent-RL-texas-holdem-aec
Multi-agent-RL-texas-holdem-aec PublicAn engineering-focused multi-agent reinforcement learning system for Texas Hold’em using PettingZoo AEC and a custom PyTorch PPO self-play setup.
Python
-
Alzheimer-Disease-Stage-Classification-CNNs-vs-Transformers-
Alzheimer-Disease-Stage-Classification-CNNs-vs-Transformers- PublicA comparative study of CNNs vs Vision Transformers for Alzheimer’s disease stage classification on brain MRI, with detailed error and performance analysis
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.