same question in
momentum, weight decay affect updating params with requires_grad=False
requires_grad=False