Backward error with einsum

Santosh_Manicka · June 18, 2018, 1:38pm

I’m receiving the error “one of the variables needed for gradient computation has been modified by an inplace operation” if I run the code below. To correct the error, I have to either set ‘iterations’ to 1 or say x.data = x.data + z[0,:,:] instead of x = x + z[0,:,:]. Is there a way to overcome this without having to resort to either of the above workarounds? I ran this code using Pytorch 0.4.0 on Windows 10. On Redhat, curiously, the code hangs. @tom - any ideas please? Thank you.

iterations = 2
m,q,p,n = 2,3,4,3
x = torch.randn(m,q)
y = torch.randn(p,n).requires_grad_(requires_grad=True)
for i in range(iterations):
    z = torch.einsum('mq,pn->pmn',(x,y))
    # x.data = x.data + z[0,:,:]  # WORKS FINE
    x = x + z[0,:,:]    # THROWS ERROR
err = torch.sum(z)
err.backward()
print(y.grad)

tom · June 19, 2018, 6:01am

Unfortunately, there is a bug with einsum in 0.4 that has only been fixed in master. You could try if, similar to the original reporter’s case, passing x.clone() to the einsum function is a feasible workaround.

Best regards

Thomas

tom · June 19, 2018, 6:33am

There really is a bug in einsum in PyTorch 0.4 (that I used unsqueeze_ where I should have used unsqueeze) and so it inplace-modifies the arguments in some circumstances. Note that x = x + ... is not an inplace modification and perfectly legitimate. (The .data stuff is bogus, but I think Santosh is aware of that, too.)

Best regards

Thomas

Jk749 · June 19, 2018, 6:39am

Okay, apologies for wrong information sharing.
Thanks for the clarification Tom.

Santosh_Manicka · June 19, 2018, 11:12am

Thank you @tom for the clarification. I ended up rewriting the einsum calculations using bmm.