Train 3D Networks with Pytorch

ltnghia · February 20, 2017, 11:31am

Hi all,

Can I train 3D networks (such as C3D, V2V) with Pytorch?
How can I generate 5D tensor (batch, channel, length, height, width) from image sequences for the network?
Could you please show me some examples?

Thank you in advance.

apaszke · February 20, 2017, 11:34am

Yes, we support that. You can find modules like Conv3d and pooling for volumes. I don’t understand the question about generating batches, you need to write your own Dataset that will load and return 4D elements from __getitem__. Then use a DataLoader to load the data (you can give it a batch_size argument, and it will batch the 4D elements into a 5D tensor for you).

ltnghia · February 20, 2017, 11:45am

Thanks. Do you support Deconv3D?

apaszke · February 20, 2017, 12:22pm

Yes. Look for ConvTranspose3d.

YunnFeng · February 20, 2017, 1:53pm

The return 4D elements is (c,d,h,w) ?

Ismail_Elezi · February 20, 2017, 2:00pm

The return is a tuple of 5 elements (N, c, d, h, w).

Link to the documentation: http://pytorch.org/docs/nn.html#convtranspose3d

apaszke · February 20, 2017, 2:03pm

You need to return 4D elements from the dataset, so they will be concatenated into 5D batches. Then, we only support batch mode in nn, so you’ll be using 5D tensors throught the network.

YunnFeng · February 20, 2017, 2:05pm

Sorry, I means that the function getitem return the dim of elements, when I write the Dataset.

apaszke · February 20, 2017, 2:58pm

As I said, __getitem__ should return 4D elements, because the DataLoader will concatenate them into a 5D batch.