What does affine do in nn.Conv2d(...)?

filleloup · November 1, 2020, 2:43pm

Hi!
I would appreciate it if you could give me a detailed explanation of what affine does in nn.Conv2d() or nn.BatchNorm2d().
Thank you!

googlebot · November 1, 2020, 3:51pm

It is just scale & shift: y = x*w+b, for batch norm it is done channelwise, i.e.: x[B,C,H,W] * w[1,C,1,1]+b[1,C,1,1].

Conv operations don’t have this functionality, as kernel and bias parameters implicitly do scale & shift.

filleloup · November 1, 2020, 3:52pm

Thank you for responding!