My REINFORCE model is not learning

Hi,

I tried to apply same concept in pong game but by directly scaling the gradients way. It does not learn.
Please check this: Implementing reinforce using gradient scaling
my implementation

Thanks