My task is an order sensitive problem.
So I don’t want to sort my mini-batch by its sequence length to use pack_padded_sequence
function.
I just realized that an output of LSTM differs before and after using the pack_padded_sequence
function.
Could anyone explain this result?