Example code of recurrent policy gradient?

Xingdong_Zuo · December 13, 2017, 1:32pm

Is there an example code for recurrent policy gradient ? Will it be simply replacing MLP with RNN ?

dgriff · December 24, 2017, 12:18am

I’ve got examples of recurrent policy gradients here in newly made repo for a3c continuous action spaces:

Can also see older discrete action spaces for Atari repo: