Recently, I found
pad_packed_sequence for RNN modules. But I am not sure when these functions are useful.
Q1. To use
pack_padded_sequence, sorting tensors by length is needed for every mini-batch. This is cost. On the other hand, I heard
pack_padded_sequence skip calculation of padded elements. Compared them, skipping calculation is much beneficial? The below link is an example of
pack_sequence looks similar to
pack_sequence doesn’t need to use padded data. But what kind of situations can we use this function?