Like to see the exact network architecture of resnet v1.5.
I see two references:
“where a stride 2 is used on the 3x3 conv rather than the first 1x1 in the bottleneck”
" Strided convolution : When using the bottleneck architecture, we use stride 2 in the 3x3 convolution, instead of the first 1x1 convolution."
I have two questions.
From the table in the resnet paper: https://arxiv.org/pdf/1512.03385.pdf
“Downsampling is performed by conv3 1, conv4 1, and conv5 1 with a stride of 2.”,
In v1.5, these 1x1 convolutions of stride 2 are removed and the following 3x3 is made into stride 2? (That would make sense) So, just 3 1x1 convolution blocks are removed and the 3 3x3 convolutions are made into stride 2?
Is this all the changes?