Loading learning rate scheduler

Sarah_Lancaster · November 15, 2021, 12:47am

I’m confused about the order of instantiating and loading a learning rate scheduler object.

In the code below, I would like to resume training from a previous checkpoint. Setting last_epoch to that provided in the checkpoint ensures that the initial learning rates for the parameters are set correctly. However, calling load_state_dict after overwrites last_epoch (which would’ve been incremented by 1 in the instantiation due to the call to step), causing discrepancies in the resumption of the schedule.

Is this a bug or is there a proper way of doing the instantiation and restoration?

lr_scheduler=optim.lr_scheduler.CosineAnnealingLR(optimizer, opt.epochs,
                                                            last_epoch=checkpoint['lr_scheduler']['last_epoch'] if checkpoint else -1)

if checkpoint: lr_scheduler.load_state_dict(checkpoint['lr_scheduler'])