Reset optimizer stats

AreTor · April 26, 2020, 1:51pm

What’s the easiest way to reset an optimizer stats, such as Adam’s moving averages, while keeping the same weights?

To make an example, suppose I have a model and I have pretrained it on a dataset using Adam. Now, I want to reset Adam’s stats and train the model on another dataset, while keeping the same parameters to be optimized. What’s the best way to reset the optimizers state, except for param_groups?

ptrblck · April 27, 2020, 1:54am

You could simply recreate the optimizer via:

optimizer = torch.optim.Adam(model.parameters(), lr=lr)

Would that work or do I misunderstand your question?

AreTor · April 27, 2020, 9:41am

Unfortunately, I don’t have access to the parameters such as the learning rate. The only data I have are a the model and the optimizer, that’s why I would like to “reset” the optimizer itself. Currently what I do is the following:

self.optimizer.__setstate__({'state': defaultdict(dict)})

and then reuse the same optimizer. Could it be a solution?

ptrblck · April 28, 2020, 4:24am

If you have the optimizer instance, you should be able to get its attributes via:

optimizer.param_groups[0]['lr']

Your approach might work, but I would rather like to avoid manipulating the internal states.

AreTor · April 28, 2020, 7:05am

Thanks for the answer, I considered also that way, however I was confused by param_groups and defaults and decided to tweak directly the state. All in all, I would consider to add a method reset, to be able to reuse the same optimizer. Would it make sense?

ptrblck · April 28, 2020, 7:08am

That sounds like a good idea.
It would just reset the internal states for some optimizers or do you have anything else in mind?

Would you like to create this feature request on GitHub and explain your use case a bit?
Also, would you be interested on implementing this feature in case the proposal gets some positive feedback?

AreTor · April 28, 2020, 7:34am

Yep, I’ll create the feature request, let’s see how it evolves

Atcold · November 4, 2022, 7:39pm

Any link to this feature? I just need the exact same thing.

ptrblck · November 5, 2022, 12:04am

This should be the corresponding issue, so could you please comment on it with your use case, please?

Roy_Wang · December 20, 2024, 8:48am

I have a similar use case, and would like to know if something like this works (almost always):

new_optimizer = type(optimizer)(params=optimizer.param_groups, defaults=optimizer.defaults)