Residual connection implementation

1c9d70faac66efabd051 · December 22, 2018, 7:41am

Now i am studying “swish net” that model for audio segmentation.

In that paper, they used strided convolution & residual net. Follw image is from [1812.00149] SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation.

after through stride=2 conv layer, its output length will be half of the input length.

Here, my question is…
how can merger output with input(residual connection) even their array dimension is mismatched?

G.A is just gated activation function, so it doesnt affect on the output dimension.

vaishnavm217 · December 22, 2018, 8:20am

You have to use any linear transformation. Resnet (which has residual connections) has linear transformations to handle that.

github.com

pytorch/vision/blob/038105ff3a18f42903903d2a7714f2e083932a6a/torchvision/models/resnet.py#L133-L139


def _make_layer(self, block, planes, blocks, stride=1):
    downsample = None
    if stride != 1 or self.inplanes != planes * block.expansion:
        downsample = nn.Sequential(
            conv1x1(self.inplanes, planes * block.expansion, stride),
            nn.BatchNorm2d(planes * block.expansion),
        )