Why save optimizer state dict?

Sure, you’re just resetting learning rate “adaptations” from old training.

1 Like