Downsampling at resnet

Ali_Mirzaeyan · March 5, 2019, 11:53pm

Hi,
the following picture is a snippet of resnet 18 structure. I got confused about the dimensions. I thought the input size of a layer should be the same as the output size of the previous layer. I wonder those highlighted numbers, shouldn’t have the same value?

chenglu · March 6, 2019, 1:10am

you should take a look at the logic of the forward function, the structure of layers do not represent the flow of tensors.

Ali_Mirzaeyan · March 6, 2019, 1:26am

this is the forward path:

def forward(self, x):
    x = self.conv1(x)
    x = self.bn1(x)
    x = self.relu(x)
    x = self.maxpool(x)

    x = self.layer1(x)
    x = self.layer2(x)
    x = self.layer3(x)
    x = self.layer4(x)

    x = self.avgpool(x)
    x = x.view(x.size(0), -1)
    x = self.fc(x)

    return x

it seems the snippet(layer4) will run as it is.

chenglu · March 6, 2019, 3:45am

This is mine resnet18 output, did you use the resnet18 from torchvision or some other implementation?

Ali_Mirzaeyan · March 6, 2019, 12:27pm

I took it from here:

github.com

pytorch/vision/blob/master/torchvision/models/resnet.py

import torch.nn as nn
import torch.utils.model_zoo as model_zoo


__all__ = ['ResNet', 'resnet18', 'resnet34', 'resnet50', 'resnet101',
           'resnet152']


model_urls = {
    'resnet18': 'https://download.pytorch.org/models/resnet18-5c106cde.pth',
    'resnet34': 'https://download.pytorch.org/models/resnet34-333f7ec4.pth',
    'resnet50': 'https://download.pytorch.org/models/resnet50-19c8e357.pth',
    'resnet101': 'https://download.pytorch.org/models/resnet101-5d3b4d8f.pth',
    'resnet152': 'https://download.pytorch.org/models/resnet152-b121ed2d.pth',
}


def conv3x3(in_planes, out_planes, stride=1):
    """3x3 convolution with padding"""
    return nn.Conv2d(in_planes, out_planes, kernel_size=3, stride=stride,

This file has been truncated. show original

Ali_Mirzaeyan · March 6, 2019, 12:46pm

apparently, I mistakenly initialize resnet with Bottleneck instead of BasicBlock. However, the problem is still there:

ptrblck · March 6, 2019, 1:00pm

As @chenglu said, the forward logic might differ from the order of the stored modules.
If you look at this line of code, you’ll see, that self.downsample is appiled on x, which differs from the output of self.bn2.