Gradient of hidden state in TBPTT

Hi! Unfortunately, I ended up abandoning my attempt to implement TBPTT. Just in case you missed it, here is a more active thread on the topic that you may find interesting:Implementing Truncated Backpropagation Through Time - #7 by riccardosamperna