A question about `padding` in `nn.MaxPool2d`

Ren_Pang · February 25, 2022, 7:11am

According to Google’s pytorch implementation of Big Data Transfer, there is subtle difference between the following 2 approaches. Could anyone explain the difference? Is it some different strategy for boundary pixels?

What’s the purpose of spliting padding parameter from nn.MaxPool2d and making it a separate nn.Pad layer before the pooling?

github.com

google-research/big_transfer/blob/140de6e704fd8d61f3e5ea20ffde130b7d5fd065/bit_pytorch/models.py#L121-L124

      
        
            ('pad', nn.ConstantPad2d(1, 0)),
            ('pool', nn.MaxPool2d(kernel_size=3, stride=2, padding=0)),
            # The following is subtly not the same!
            # ('pool', nn.MaxPool2d(kernel_size=3, stride=2, padding=1)),