Neural Language Processing with just one time initialization

Aliaska · March 18, 2021, 12:45pm

Hi everyone,

Could anyone help me how I can train a model in Pytorch with a hidden layer for all the sequences that just one time is initialized with zero at the first forward pass and then not be reseted to zero for each sequence? A simple code would be highly appreciated!!

Dwight_Foster · March 18, 2021, 1:44pm

You can have the forward function return the hidden state as well as the output state and then pass it back into the function every time.

#INITIALIZE HIDDEN STATE OUTSIDE OF LOOP
def forward(self, input, hidden):
  
   #FORWARD PASS HERE
   return output, hidden

then each time you can just pass the returned hidden back into the forward function.

Aliaska · March 18, 2021, 1:50pm

Thanks, so this way the hidden would be treated as an input

Dwight_Foster · March 18, 2021, 1:51pm

You should do it outside of your epoch loop like this:

hidden = #your hidden initialization function or variable
for e in range(epochs)

Aliaska · March 18, 2021, 1:54pm

Super clear, thanks, I hope it will be resolved