How to turn list of varying length tensor into a tensor

Shihan_su · March 26, 2017, 5:24am

Hi, I am currently trying to do batch training on RNN.
The first step is to pad the batch of sequence using pack_padded_sequence().
But the function seems to take Variable as input, which means it need to be a tensor.

Currently, my input format is a list of tensors with varying length.
When I try to turn that list into a tensor, it throws me the error: ‘FloatTensor’ object does not support indexing

It seems that I cannot create tensor with varying length on any dimension.

My goal is to created a batch of padded sequence from list of tensors with varying length.
What may be the correct way of implementation ?

smth · March 26, 2017, 7:30pm

The code here will show you how to pad your tensors of different lengths into a single tensor used by pack_padded_sequence:

github.com

pytorch/pytorch/blob/master/test/test_nn.py#L1336-L1344


    self._test_Conv2d_naive_groups(torch.cuda.FloatTensor)


def test_batchnorm_eval(self):
    self._test_batchnorm_eval()


@unittest.skipIf(not TEST_CUDA, "CUDA unavailable")
def test_batchnorm_eval_cuda(self):
    self._test_batchnorm_eval(torch.cuda.FloatTensor)

Shihan_su · March 27, 2017, 2:06am

If I understand correctly, that means I need to go through some procedure to pad the sequence myself.
In this case, what are the purpose of pack_padded_seqence() and pad_packed_sequence() functions ?
I thought they were used somehow to pad sequence with varying length automatically.

ZeweiChu · March 28, 2017, 1:23pm

These two functions help you pack (already) padded sequences and pad (already) packed sequences. I think the purpose is not to help you pad sequences, but to save some spaces. Check out the following example:

import itertools
import torch
import torch.nn as nn
import torch.nn.utils.rnn as rnn_utils
from torch.autograd import Variable, gradcheck
def test_pack_padded_sequence():
def pad(tensor, length):
return torch.cat([tensor, tensor.new(length - tensor.size(0), *tensor.size()[1:]).zero_()])
lengths = [2,1]#[10, 8, 4, 2, 2, 2, 1]
max_length = lengths[0]
batch_sizes = [sum(map(bool, filter(lambda x: x >= i, lengths))) for i in range(1, max_length + 1)]
offset = 0
padded = torch.cat([pad(i * 100 + torch.range(1, 5 * l).view(l, 1, 5), max_length) for i, l in enumerate(lengths, 1)], 1)
padded = Variable(padded, requires_grad=True)
expected_data = [[torch.range(1, 5) + i * 100 for i in range(batch_size)] for batch_size in batch_sizes]
expected_data = list(itertools.chain.from_iterable(expected_data))
expected_data = torch.cat(expected_data)
for batch_first in (True, False):
src = padded
if batch_first:
src = src.transpose(0, 1)
print(“SRCSRCSRC”)
print(src)
print(“LENGTHS”)
print(lengths)
packed = rnn_utils.pack_padded_sequence(src, lengths, batch_first=batch_first)
print(“PACKED”)
print(packed)
unpacked, unpacked_len = rnn_utils.pad_packed_sequence(packed, batch_first=batch_first)
print(“UNPACKED”)
print(unpacked)
print(“UNPACKED_LEN”)
print(unpacked_len)
test_pack_padded_sequence()

apaszke · March 28, 2017, 1:25pm

And apart from saving space, it allows to use nn.LSTM with variable length sequences, which can speed up your model quite a lot if you’re using bidirectional RNNs, or using the last hidden states for some purposes. We don’t have a function that takes in a list of tensors and pad then, but it sounds useful. Can you open a feature request?

ecolss · April 12, 2017, 7:33am

Any detailed examples on these 2 functions, pack_padded pad_packed? Seems confusing to me.

I thought, pack_padded would turn an already padded tensor (i.e. with each row having the same length, suppose 2d tensors here), into another one with each row having different length (just like sparse matrix?)

smth · April 12, 2017, 3:08pm

@ecolss see the examples in the release notes here: https://github.com/pytorch/pytorch/releases/tag/v0.1.10

ecolss · April 13, 2017, 1:39am

As illustrated in the release notes, PackedSequence holds data and a list of sequence sizes of a packed sequence batch.

Then why would we have to first pad the seq to equal length manually and then pass it to pack_padded func to generate this PackedSequence object? Wouldn’t it be easier to just construct the object directly from the original seq without manual padding?

lgelderloos · May 29, 2017, 1:18pm

@ecolss , @apaszke suggested making a feature request for this as it seems useful. was wondering if you followed up on it?

ecolss · May 30, 2017, 4:05am

I’m not following up on this, but I agree with you on this.

nilbot · September 21, 2017, 4:28pm

just to give a consistent link for those lines, you can press ‘y’ in github and get the link.
here is the updated link:

github.com

pytorch/pytorch/blob/892940b45a20d94450661a3108f4723a0f4815b1/test/test_nn.py#L1914-L1923


def test_pack_padded_sequence(self):
    def pad(tensor, length):
        return torch.cat([tensor, tensor.new(length - tensor.size(0), *tensor.size()[1:]).zero_()])
    lengths = [10, 8, 4, 2, 2, 2, 1]
    max_length = lengths[0]
    batch_sizes = [sum(map(bool, filter(lambda x: x >= i, lengths))) for i in range(1, max_length + 1)]
    offset = 0
    padded = torch.cat([pad(i * 100 + torch.arange(1, 5 * l + 1).view(l, 1, 5), max_length)
                        for i, l in enumerate(lengths, 1)], 1)
    padded = Variable(padded, requires_grad=True)

RylanSchaeffer · July 3, 2019, 8:09pm

For anyone wondering if PyTorch has created a function that takes a list of tensors and pads them to be the same length, the following function does the trick:

https://pytorch.org/docs/stable/nn.html#torch.nn.utils.rnn.pad_sequence

Example:

>>> import torch
>>> l = [torch.Tensor([1., 2.]), torch.Tensor([3.])]
>>> torch.nn.utils.rnn.pad_sequence(l, batch_first=True, padding_value=0)
tensor([[1., 2.],
        [3., 0.]])