It currently trains well on Catch, still running experiments on Atari. I am yet to find a good set of hyperparams for Atari games, will post updates about this.
You can find the repo here, let me know if you have any suggestions / comments.
He was teaching RL in my undergrad school in France. This course changed my life.
The paper is amazing, now I’m looking forward seeing how it reacts with some parallelism boost.