RNN Pack Padded Sequence Hidden State

Rafael_R · October 21, 2019, 9:52pm

Hi,

When using the pack padded sequence, what should be the final lstm state that is passed to the decoder?

Should it be hidden or output?

packed = torch.nn.utils.rnn.pack_padded_sequence(embedded, input_lengths, batch_first=True)
output, hidden = self.text_LSTM(packed, None)
output, _ = torch.nn.utils.rnn.pad_packed_sequence(output, batch_first=True)

Cheers!

InnovArul · October 22, 2019, 12:57am

The output contains the output of LSTM for all timesteps for all sequences.

The hidden has two components.

The output of last timestep (of the individual sequences).
The cell state after last timestep

In my understanding, you may want to pass the first component of the hidden variable (hidden[0]) to the decoder, as it represents the encoding of the whole sentence after having seen all the words in the sentence. Again, its a design choice about your network.

Rafael_R · October 22, 2019, 5:36am

thanks @InnovArul at some places like here i see output[:,-1,:] passed, are hidden[0] and output[:, -1,:] the same?

InnovArul · October 22, 2019, 12:08pm

Yes, they are same. In addition, output contains the output from all the timesteps, but hidden[0] has only the output of the last timestep.