How can I update only a part of one weight in the backpropagation

Shixian_Wen · March 20, 2018, 7:58pm

For example in my network.
One of the fully connected layer has a weight [100,300] which maps the input of size [batch_size,100] to output[batch_size, 300].
How can I only allow the first 30 neurons change its gradient and let the rest of 70 freeze without setting the gradient to 0 which is not I want because it cannot back propagate the gradients to the former layer. The strategy is on the runtime, so you cannot set it beforehand and can only change it on the runtime.

What I try to do is: weight[30:100,:].requires_grad = False.
However, I got the following error:
*** RuntimeError: you can only change requires_grad flags of leaf variables. If you want to use a computed variable in a subgraph that doesn’t require differentiation use var_no_grad = var.detach().

jpeg729 · March 20, 2018, 9:05pm

You can’t set requires_grad=False on just part of a Variable.

I would suggest zeroing the relevant gradients manually after calling loss.backward(). That won’t affect the gradients passed to the lower levels because the backward pass will already have been completed.

Shixian_Wen · March 21, 2018, 1:38am

thank you veru much! This is a really good solution!