Combining Trained Models in PyTorch

ptrblck · May 19, 2020, 4:53pm

The gradients will be calculated for the parameters of all submodules as well as the final classifier, if you didn’t disable it via .requires_grad=False.
Also the state_dict() will return all parameters of all submodules and the final classifier.