How do you update batchnorm statistics of a SWA model when using DDP?

nsriniva03 · January 26, 2022, 4:22pm

Once training is done and SWA model is computed, how do you update the batchnorm statistics when using DDP ?

Do you still just use, torch.optim.swa_utils.update_bn(train_loader, swa_model) and the parameters are updated across different ranks?

H-Huang · January 28, 2022, 9:26pm

Since each rank contains a copy of the model I believe you just have to call update_bn on each rank as you would for a single model and it should work.