Strange problem when manually gradient descent

JerryLauzzz · August 29, 2018, 11:37am

If I code:

with torch.no_grad():
    w -= 0.01 * w.grad
    w.grad.zero_()

Everything works well with no error.
However, if I code:

with torch.no_grad():
    w = w - 0.01 * w.grad
    w.grad.zero_()

I meet the following error:

Traceback (most recent call last):
  File "/home/jihao/deep_learning/auto_gradient.py", line 34, in <module>
    w.grad.zero_()
AttributeError: 'NoneType' object has no attribute 'zero_'

What’s the difference between the two snips of code?

Deepali · August 29, 2018, 12:46pm

w -= 0.01 * w.grad is an in-place operation, so it performs calculation on existing w and updates the value.

However, w = w - 0.01 * w.grad is not in-place operation, so it creates a new variable w, which does not have requires_grad set and so the error.

You can quickly check this by calling - print(w.requires_grad). In the first case you would get True wheres in the second case it would be False.

JerryLauzzz · August 29, 2018, 1:18pm

I understand. Thank you so much!

Deepali · August 31, 2018, 5:42am

Could you please mark the thread as solved for the benefit of other members.

JerryLauzzz · August 31, 2018, 10:43am

Already marked. Thank you very much for solving my first question here.

Deepali · August 31, 2018, 12:18pm

Ah great. Thanks