Could anyone help me how I can train a model in Pytorch with a hidden layer for all the sequences that just one time is initialized with zero at the first forward pass and then not be reseted to zero for each sequence? A simple code would be highly appreciated!!