Autograd not working properly for LongTensor?

ecvgit · July 8, 2018, 1:48am

Hi,

It seems like there is a difference in results when gradients are calculated for LongTensor

import torch
from torch.autograd import grad
print(torch.__version__)
x = torch.tensor([1,2,2], requires_grad=True)
l = torch.norm(x.float())
g = torch.autograd.grad(l, x)
print(g)

Prints (tensor([ 0, 0, 0]),) which is wrong.

However, if change the code slightly to

x = torch.tensor([1.0,2,2], requires_grad=True) #Note 1.0 instead of 1
l = torch.norm(x.float())
g = torch.autograd.grad(l, x)
print(g)

It prints (tensor([ 0.3333, 0.6667, 0.6667]),) which is correct.

Any idea what might be happening?

SimonW · July 8, 2018, 4:18am

tensors of integral types shouldn’t require grad. this has been implemented as a hard constraint on master.

ecvgit · July 8, 2018, 5:21am

What is the reason for this restriction? Also, I think it might be better to throw an exception in this case as opposed to failing silently.

SimonW · July 8, 2018, 6:00pm

yes, as i said, it throws hard error on master now.

assuming you are doing gradient descent type optimization, then since integral types are discrete, so it’s natural to not allow this.