Load/save of nn.DataParallel instance

wwiiiii · October 25, 2018, 8:29am

It seems it’s better to load/save the state dict of “module” instance in nn.DataParallel, not the nn.DataParallel itself.
But I’m not sure if it’s valid option. Is it recommended way to do so?

model = resnet101()
model = torch.nn.DataParallel(model)
torch.save(model.module.state_dict(), 'state')

model2 = resnet101()
model2 = torch.nn.DataParallel(model2)
model2.module.load_state_dict(torch.load('state'))

ptrblck · October 25, 2018, 12:18pm

Your code looks nice and clean and is in my opinion a good alternative to removing the .module part of the state_dict manually.

wwiiiii · October 30, 2018, 4:34am

I just found that nn.DataParallel works well even if there’s no available GPU.
Maybe it would better using nn.DataParallel all the time with or without GPU?