[SOLVED] Load_state_dict error

Petro_Key · November 6, 2019, 3:00am

Hi, thank you always for your help.
When I load my trained model using load_state_dict, it raises the following error:

File “/root/.pyenv/versions/anaconda3-5.3.1/lib/python3.7/site-packages/torch/optim/optimizer.py”, line 114, in load_state_dict
raise ValueError("loaded state dict contains a parameter group "
ValueError: loaded state dict contains a parameter group that doesn’t match the size of optimizer’s group

I could loaded trained models until yesterday, and have suddenly become unable to load them.
I will looking forward to hearing any suggestion to correct this issue.

Thank you in advance:)

tom · November 6, 2019, 5:15am

Apparently the number parameters known to the optimizer changed somehow.
Note that this is from loading the optimizer’s state dict, not the model.

Best regards

Thomas

Petro_Key · November 6, 2019, 7:01am

Hi tom,
Thank you for your kind reply.
As you suggested, I used wrong numbers of parameters.
I tried to load parameters for different layer depth by mistake.
It now works by setting the model size and parameter size.

Thank you for your help!

SU_HE · May 3, 2020, 2:54pm

Hi Tom,
I met an issue that I want to add an additional layer on the original model but still using a pretrained checkpoint. Is there a possible solution?

tom · May 3, 2020, 9:12pm

Yes, you can either modify the state dict or make load_state_dict less strict.
Personally, I tend to favor the former variant (having a translation function for keys and/or adding the model.state_dict() values for things not in the saved state dict) because it seems less likely that I forget things, but the latter would probably be faster.

Best regards

Thomas

SU_HE · May 4, 2020, 6:39am

Thanks for your information. It is very helpful!
Best regards!
Su

loner · July 26, 2022, 11:49am

Hi Tom,

in train
last_step = saver.restore(model_dir, map_location=self.device)

raise ValueError("loaded state dict contains a parameter group "
ValueError: loaded state dict contains a parameter group that doesn’t match the size of optimizer’s group

Is there a possible solution?

tom · July 26, 2022, 12:12pm

The problem you have there likely is that the number of parameters has changed between saving the optimizer and restoring it. I am afraid that this is a tough problem as the optimizer is completely unaware of which parameter is what (the parameters are an just an ordered list to the optimizer).

Best regards

Thomas