Proximal Policy Optimization

mhubii · March 18, 2019, 4:08pm

Hey,

I just wanted to share this implementation of Proximal Policy Optimization for the C++ API of Pytorch with you. Feedback is much appreciated. I struggle on letting the algorithm converge for harder problems than this, shown on GitHub.

smth · March 20, 2019, 3:27pm

Thanks a lot for sharing Martin!

adik993 · March 24, 2019, 6:47pm

PPO is tricky one to fine tune. For me it was a lot of trial and error to get it work on my own implementation. You may take a look at the hyperparameters I used for some OpenAI gym problems, maybe it will work for you too. You may also want to try normalize the state if it is complex on the harder problems. I hope it helps