`getitem()` does not support returning tensors with different sizes

Zhaoyi-Yan · April 23, 2020, 9:43am

class TrainDataset(torch.utils.data.Dataset):
...
def __getitem__(self, index):
...
# suppose we return this image
img = torch.randn(3, 240, 240)

channel_tmp = 10
other_info = torch.randn(channel_tmp , 240, 240)
return img, other_info

However, here, in my case, I need something like this:

class TrainDataset(torch.utils.data.Dataset):
...
def __getitem__(self, index):
...
# suppose we return this image
img = torch.randn(3, 240, 240)

channel_tmp = np.random.randint(5, 10)
other_info = torch.randn(channel_tmp , 240, 240)
return img, other_info

Of course, this snippet does not work, as this line of code return torch.stack(batch, 0, out=out) in torch/utils/data/_utils/collate.py.

Edit:

The channel_tmp varies a lot (from very small to a very large number), so if always set the channel_tmp the largest possible number and when the batchsize is 4 or 8, it will be OOM.
In fact, I want something like this, for this specific variable other_info, I need data from each thread can be stacked on the channel_tmp dimension.`

Is there any workaround to this?

ptrblck · April 23, 2020, 9:50am

You could use a custom collate_fn as described here which would work for variable shapes.

Zhaoyi-Yan · April 23, 2020, 9:53am

Thank you, ptrblck, I will take a look!

Zhaoyi-Yan · April 23, 2020, 1:13pm

 def my_collfn(batch):
     img = [item[0] for item in batch]
     other_info= [item[1] for item in batch]

     img = torch.stack(img, 0)
     other_info= torch.cat(other_info, dim=0)

     return img, other_info

`__getitem__()` does not support returning tensors with different sizes

`getitem()` does not support returning tensors with different sizes