Conversation
paulo101977
left a comment
There was a problem hiding this comment.
Ops, I forgot of send a message here.
|
Hello, |
|
Some report about the performance: I could get good results on Breakout but couldn't get it to work on Pong for now. Runs can be found on W&B: https://wandb.ai/openrlbenchmark/sbx |
Dont worry!!!! I appreciate you looking at my code. |
|
After sharing the CNN features extractor between the actor and the critic, the learning curves match the ones from SB3 =) I'll do a bit more runs and then merge. |
Description
closes #80
Performance report: https://wandb.ai/openrlbenchmark/sbx/reports/PPO-CNN-Performance-report--VmlldzoxNDU2OTk2OA
Motivation and Context
Types of changes
Checklist:
make format(required)make check-codestyleandmake lint(required)make pytestandmake typeboth pass. (required)make doc(required)Note: You can run most of the checks using
make commit-checks.Note: we are using a maximum length of 127 characters per line