Correct way to do backpropagation through time?

riccardosamperna · August 13, 2018, 4:59pm

I have a similar issue in this post.

I followed the pseudocode for the non-truncated BPTT in this conversation, the network trains but I have the feeling that the gradient is not flowing through time. I posted my training code for the network.

Can someone give some tips?