Using optimizers with torch.autograd.grad()?

fritzo · December 12, 2017, 8:54pm

My team is trying to use torch.autograd.grad() with torch optimizers, but optimizers only consume .grad properties, not arbitrary lists of gradients. To work around this, we manually set .grad properties from stuff we compute with grad().

Is there a better pattern?

smth · December 12, 2017, 9:00pm

i think the optimizers should take an optional “grads” argument, either in step or at construction time to facilitate this. Worth a discussion / RFC on https://github.com/pytorch/pytorch/issues