Change learning rate in pytorch

Peter_Ham · March 9, 2018, 8:05am

I’m using adam for optimization. Should I change learning rate using this?

for param_group in optimizer.param_groups:
        param_group['lr'] = lr

It seems every time I change the learning rate, the loss increases a lot, and the accuracy goes down at the learning rate transition point. What’s the reason?

norm_inf · March 9, 2018, 8:35am

You could try to use lr_scheduler for that -> http://pytorch.org/docs/master/optim.html

austin · March 12, 2018, 12:02am

That is the correct way to manually change a learning rate and it’s fine to use it with Adam. As for the reason your loss increases when you change it. We can’t even guess without knowing how you’re changing the learning rate (increase or decrease), if that’s the training or validation loss/accuracy, and details about the problem you’re solving. The reasons could be anything from “you’re choosing the wrong learning rate” to “Your optimization jumped out of a local minimum”.

It’s likely best to get more intuition as to what’s happening with the optimization on your own if you’re interested.

adam paper

sgdr paper

Peter_Ham · March 13, 2018, 6:11pm

Thanks. I’m actually decreasing the learning rate by multiplying it with 0.99 every epoch.

SimonW · March 13, 2018, 6:24pm

\sum_i 0.99^i is a convergent sum, you should consider something that converges to 0, but the sum diverges.

Peter_Ham · March 16, 2018, 7:40pm

I don’t understand this, why should the sum diverge?

SimonW · March 17, 2018, 5:37am

Nevermind, I somehow was thinking about convex problems My bad.