Authors: Gokul Puthumanaillam, Manav Vora
This is a code for comparing different algorithms to solve the lunar lander problem. The algorithms simulated are:
- Monte Carlo
- Q-Learning
- DQN
- SARSA
| Simulation | Final Trajectory |
|---|---|
Random policy |
![]() |
Monte Carlo |
![]() |
QLearning |
![]() |
SARSA |
![]() |
DQN |
![]() |









