No Bias in the pretrianed state dictionary of ResNet18

In general, ResNet’s conv layers are followed by batch norm layers which perform mean subtraction. This mean subtraction step will effectively remove the influence of bias. Hence, bias is not used in conv layers (even if bias is used, their gradients would be almost 0).

1 Like