Effect of calling model.cuda() after constructing an optimizer

SimonW (Simon Wang) March 21, 2018, 1:43am 7

Can you try adagrad? Adam doesn’t initialize such buffers: https://github.com/pytorch/pytorch/blob/master/torch/optim/adam.py

2 Likes