ModuleValidator.fix() causes gradients to be None

clr · March 18, 2024, 5:38pm

After I replacing BatchNorm by GroupNorm with ModuleValidator.fix(), and the gradients of parameters become None when I get them from model.named_parameters(). The grad_sample of parameters are also None. So the gradient is not flowing to the replaced GroupNorm weights when running backward pass.

ptrblck · March 19, 2024, 1:31am

Could you explain what ModuleValidator.fix() does as it seems to be a custom method?

clr · March 20, 2024, 6:28pm

It’s a method from Opacus Opacus · Train PyTorch models with Differential Privacy