Play hockey via Reinforcement Learning (Soft Actor Critic)

The soft actor critic algorithm is implemented in order to train an agent for the provided hockey environment. This project was done as part of my Reinforcement Learning course at the University of Tübingen.

Played environments

As the hockey environment was completely new and a little more challenging, a more easy environment was chosen at the beginning.

Pendulum

Firstly, the agent was trained to handle the pendulum environment. It is considered a relatively easy environment which helped me to see if the approaches and implementations I did were legitimate. For more information about this game, have a look at the official gym documentation.

Hockey

After establishing a good foundation and successfully managing the pendulum environment, the hockey game provided by our professor was then targeted. The hockey environment is a game between two players, where we can control the left player

Using SAC

Soft Actor-Critic (SAC) is an off-policy algorithm. Unlike other off-policy algorithms (e.g. TD3), its exploration comes from its emphasis on maximizing the entropy, which represents the uncertainty in the agent’s actions. By maximizing entropy, SAC encourages exploration and prevents the agent from prematurely converging to maybe suboptimal policies. For further information about SAC, my approach and results, please take a closer at my paper.

Results

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.vscode		.vscode
05.08.2023/21.48.57		05.08.2023/21.48.57
07.08.2023		07.08.2023
08.08.2023/11.25.38		08.08.2023/11.25.38
09.08.2023/21.30.28		09.08.2023/21.30.28
10.08.2023/13.15.59		10.08.2023/13.15.59
29.07.2023		29.07.2023
30.07.2023/15.38.22		30.07.2023/15.38.22
assets/img		assets/img
eval/images		eval/images
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
buffer.py		buffer.py
concept.ipynb		concept.ipynb
eval.py		eval.py
main_hockey.py		main_hockey.py
main_pendulum.py		main_pendulum.py
networks.py		networks.py
sac_okan.pdf		sac_okan.pdf
utils_sac.py		utils_sac.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Play hockey via Reinforcement Learning (Soft Actor Critic)

Played environments

Pendulum

Hockey

Using SAC

Results

About

Uh oh!

Releases

Packages

Uh oh!

Languages

okihnjo/ReinforcementLearning

Folders and files

Latest commit

History

Repository files navigation

Play hockey via Reinforcement Learning (Soft Actor Critic)

Played environments

Pendulum

Hockey

Using SAC

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages