Trained model using DataParallel cannot be loaded for inference on 1 gpu

Thank you, Juan
that’s refreshing to know especially when these things are not reflected in any documentation.
If you don’t mind me asking do you have any idea on what is the proper usage of distributed.DataParallel?

For instance here I’m explaining how I’ve used it and still end up getting error during execution.