I am using a Unet model (a simple encoder with average pooling and a decoder using ConvTranspose2d function) for image upsampling (super-resolution). A pair of images (input on the left and target on the middle) is used for training as shown below. Every other column from the target image is removed and zero-padded to come up with the input image.
@J_Johnson Yes, the output looks close to the target. I have uploaded the full-size validation target and output images at Unet — ImgBB. You can see the artifacts when you zoom in on the output.
@Soumya_Kundu, I am feeding the whole image. I will give reducing/patching the images a try and see if that works.