I have the following network:
input = torch.nn.utils.rnn.pack_padded_sequence(input,sequenceLengths,batch_first=True)
output, hidden = GRU(input,hiddenLayer)
output, seqLengths = torch.nn.utils.rnn.pad_packed_sequence(output,batch_first=True)
How do I do max pooling over output as sequence length is variable? Assuming output is of dimension N x L x hiddenLayerDim. N = batchSize, L=length of longest sequence.
I would like to do max pooling along the length (L) dim (=1).