Parameter vs tensor.requires_grad = True

raouf_ks · November 12, 2019, 6:51pm

Hello everyone ,
I have a question regarding the autograd system , why we have the following difference between these two pictures :

this one :

VS

this one :

JuanFMontesinos · November 12, 2019, 7:20pm

a nn.Parameters is a container aimed to create learnable tensors inside a nn.Module
That’s why it keeps tracking the grad, as any other parameter in a nn.Module would do. Besides,
You are instantiating the Parameter class with rand0.3 as argument, meanwhile in the other case the variable which has “requires_grad” is later submitted to another operation, thus, the resulting variable is not a leaf variable. Namely, w=3rand(1) is not pointing to the same object that w=rand(1). In the former case you are pointing to the result of 3*rand(1) which is a different tensor than rand(1)

import torch
w=torch.rand(5).requires_grad_()
q=3*w
w.is_leaf
Out[8]: True
q.is_leaf
Out[9]: False

raouf_ks · November 12, 2019, 7:27pm

thanks a lot you are right !

dbsx · April 23, 2021, 1:28am

Does it mean that we have to use nn.paramter (but not just tensor with requires_grad_) when building a custom network module layer?

JuanFMontesinos · April 24, 2021, 10:55am

Yes.
That’s the difference between a buffer (a tensor that is static but not intended to change by backprop) and a parameter.

The latter is returned when you invoke model.parameters() too.