In general, ResNet’s conv layers are followed by batch norm layers which perform mean
subtraction. This mean subtraction step will effectively remove the influence of bias. Hence, bias is not used in conv layers (even if bias is used, their gradients would be almost 0).
1 Like