Difference between nn.Upsample and nn.ConvTranspose2d

Xiaoyu_Song · December 20, 2018, 10:12am

Since for nn.Upsample, it’s possible to deal with 2D Tensor, so what’s the difference here between the upsample and the convtranspose2d. Thank you

ptrblck · December 20, 2018, 10:26am

While nn.Upsample uses some interpolation technique, nn.ConvTranspose uses trainable filters to create your output (similar to vanilla conv layers).

Xiaoyu_Song · December 21, 2018, 1:01am

Thank you for your information, can you explain the pros and cons of these two function.
Is the upsample recover more context information or the ConvTranspose?

ptrblck · December 21, 2018, 11:56am

Generally speaking, the ConvTranpose layer might learn some features as it’s using trainable parameters, while Upsample just interpolates.
The former approach would thus have more parameters (more capacity) and might therefore overfit easier.

I can’t really tell which approach works better in which situation, as I’ve seen both methods used for certain use cases.
While it seems that ConvTranpose layers are preferred in GANs, I’ve seen some models using Upsample performed better for segmentation tasks. This is just my biased observation, so this is not a recommendation to choose one over the other.
You should try both approaches and see, how your model performs.

Xiaoyu_Song · December 26, 2018, 7:58am

Thank you, that helps a lot.