Why isn't pad_packed_sequence sufficient ? Why do we need to pad by ourselves when we supply the sequence lengths?

dans · August 2, 2018, 12:26am

Why do we need to pad the input for variable length sequences for lstm when there is a pack_padded_sequence function that essentially tells the lstm to ignore the padded portion? Why isn’t the pack_padded_sequence function with the sequence lengths sufficient for training via mini batches? Why do we need to pre-pad the input ?

Brando_Miranda · June 14, 2019, 7:21pm

Crossposted: https://www.quora.com/unanswered/Why-do-we-need-to-pad-sequences-by-ourselves-in-Pytorch-when-we-supply-the-sequence-lengths-Why-isn-t-pad_packed_sequence-sufficient

hopefully we’ll get an answer some day.

cerisara · December 29, 2019, 9:18am

It’s because every row in a tensor has to have the same dimension. There’s no other way, as pytorch does not support python lists for this method.