Model parallelism for pretrained model

wasabi · October 27, 2022, 8:20pm

I’m trying to follow https://pytorch.org/tutorials/intermediate/model_parallel_tutorial.html for model parallelism, is there a way to make this process easy for pretrained models? For example, in this post, I can create a paralleled ModelParallelResNet50 from scratch. However, if I have a pretrained ResNet50, I cannot directly load model state into ModelParallelResNet50, since the latter has a different structure.

AlphaBetaGamma96 · October 27, 2022, 8:33pm

Hi @wasabi,

Perhaps using the strict=False argument might be a workaround for this error?