Load and freeze one model and train others

111357 · July 20, 2020, 3:30pm

I have a model A that including three submodels model1, model2, model3.
the model flow: model1 --> model2 --> model3
I have trained model1 in an independent project.
The question is how to use the pre-trained model1 when training the model A?

Now, I try to implement this as follow:
I load the checkpoint of model1 by model1.load_state_dict(torch.load(model1.pth)) and then set the requires_grad of the model1’s parameters as False?
Is it right?

Usama_Hasan · July 20, 2020, 3:48pm

That would do it as far as the freezing and loading model1 is concerned.

111357 · July 24, 2020, 12:46am

got it, thanks a lot.