Hi,
I am trying to do semantic segmentation and thought of trying out torchvision.models.segmentation.fcn_resnet50. I just see down convolution in this model and there is no upsampling happening through conv transpose layers. Then, how does the segmentation output be of same size as of the input?
Thanks