Hi, I have a question about saving the optimizer states on multi-gpu training scenario:
checkpoint = {
'model': model.module.state_dict(),
'optimizer': optimizer.module.state_dict()}
The code above does not work. If I just save like single gpu:
checkpoint = {
'model': model.module.state_dict(),
'optimizer': optimizer.state_dict()}
Is that accurate?