davidkimai

Follow

💭

optimizing my reward function

davidkimai

💭

optimizing my reward function

Follow

Stay hungry. Stay foolish. - Steve Jobs. 24. Interpreting AI Psychology. Hacking Context and Mapping Minds @open-cognition

497 followers · 257 following

Achievements

Achievements

Pinned Loading

dash dash Public

Dash - Agent Orchestration Platform

TypeScript 1
misalignment-monitoring misalignment-monitoring Public

SPAR AI. A minimal viable demo showcasing monitoring architecture for detecting deceptive AI behavior in the wild.

Python
sabotage sabotage Public

Heron AI 90min Work Test | Project - Michael Chen METR Sabotage Threat Modeling. A small eval harness designed to measure Monitor Negligence. The script simulates a "Monitor" (the insider) reviewin…

Python 2
ralph-zero ralph-zero Public

Ralph Zero - Your agents can now orchestrate Ralph using Skills! Ralph Zero is an orchestrator system wrapped in an Agent Skills package over Geoffrey Huntley's Ralph Loop that implements complex m…

Python 7 2
Context-Engineering Context-Engineering Public

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 8.3k 940
RL101 RL101 Public

Agentic Reinforcement Learning 101. A pragmatic course for AI/ML Engineers based on "The Landscape of Agentic Reinforcement Learning for LLMs: A Survey" https://arxiv.org/abs/2509.02547

Roff 16 2