Combining Trained Models in PyTorch

The gradients will be calculated for the parameters of all submodules as well as the final classifier, if you didn’t disable it via .requires_grad=False.
Also the state_dict() will return all parameters of all submodules and the final classifier.

1 Like