Size of Tensors in Autoencoders

Hacking_Pirate · November 7, 2020, 4:32pm

Hi,
I was trying to create a denoising AE, but as I was training it I got some size error eventhough both clear and pixelated images have same size.
Here is model Arch:

############### MODEL ################
class AutoEncoder(nn.Module):
  def __init__(self):
    super(AutoEncoder, self).__init__()
    self.encoder = nn.Sequential(
        nn.Conv2d(3, 64, (3,3)),
        nn.MaxPool2d((2,2)),
        nn.Conv2d(64, 32, (3,3)),
        nn.MaxPool2d((2,2)),
        nn.Conv2d(32, 16, (3,3)),
        nn.MaxPool2d((2,2)),
        nn.Conv2d(16, 8, (3,3)),
        nn.MaxPool2d((2,2)),
    )
    self.decoder = nn.Sequential(
        nn.Upsample((16,16)),
        nn.ConvTranspose2d(8, 16, (3,3)),
        nn.Upsample((32,32)),
        nn.ConvTranspose2d(16, 32, (3,3)),
        nn.Upsample((64,64)),
        nn.ConvTranspose2d(32, 64, (3,3)),
        nn.Upsample((128,128)),
        nn.ConvTranspose2d(64, 3, (3,3)),

    )

  def forward(self, xb):
    encoded = self.encoder(xb)
    return self.decoder(encoded)

The size of images is 128, 128.
HERE IS A LINK OF THE NOTEBOOK AS WELL:

Hacking_Pirate · November 8, 2020, 3:07am

@ptrblck have you any idea?

ptrblck · November 8, 2020, 11:22pm

Your model works fine with an input of [batch_size, 3, 128, 128]:

model = AutoEncoder()
x = torch.randn(2, 3, 128, 128)
out = model(x)
print(out.shape)
> torch.Size([2, 3, 130, 130])

so I guess the shape of your model output doesn’t match the target shape.
If that’s the case, you would have to change the layer setups and make sure that both these tensors have an equal spatial size.