How to modify the gradient manually?

Brasnold · September 16, 2017, 8:09pm

I need to have access to the gradient before the weights are updated.
In particular I need to modify it by multiplying it for another function.

 loss.backward()
 # Here i need to access the gradients and modified it.
 optimizer.step()

jkarimi91 · September 16, 2017, 8:36pm

Brasnold · September 16, 2017, 10:17pm

Thanks, but the response is not conclusive. I need to know how to implement somenthing like this in tensorflow.

grads_and_vars = opt.compute_gradients(loss, parameter_list)
my_grads_and_vars = [(g*C, v) for g, v in grads_and_vars]
opt.apply_gradients(my_grads_and_vars)

If i’ve understand it well, torch.backward() store the grandient in parameter.grad, so i’ve to access every parameter iterating long the module parameters list.

jkarimi91 · September 16, 2017, 10:34pm

loss.backward()
for p in model.parameters():
    p.grad *= C  # or whatever other operation
optimizer.step()

Brasnold · September 16, 2017, 10:49pm

Thanks.

And if i’ve two loss function and different operations to do?

For example, i’ve f_1 and f_2, and I want the learning signal (or the gradient to apply) to be D(f_1) + D(f_2) * C, where D(f_1) stand for derivative.

It should be:

loss2.backward()
for p in model.parameters():
p.grad *= C
loss1.backward()
optimizer.step()

This works because i compute the gradients in f_2 before than f_1, otherwise grad accumulate gradients and i can’t acces only f_2 part. There’s a way to do it regardless of order? So is there a way to access the gradient before it is accumulated in p.grad?

11193 · August 30, 2018, 2:20pm

how about this:

loss2.backward(retain_graph=True)
for p in model.parameters():
    p.grad *= C
optimizer.step()
optimizer.zero_grad()
loss1.backward()
optimizer.step()

so you can avoid accumulate gradients on parameters.

rohun · February 27, 2019, 3:41pm

I see that this allows us to update the weights twice.
But what I want to weight the gradient from loss2 with C2 and from loss1 with C1 and make a single update?

huahuanZ · March 1, 2021, 4:47pm

How about this

loss1.backward(torch.ones_like(loss1)*C1)
loss2.backward(torch.ones_like(loss2)*C2)
optimizer.step()