Torch.no_grad and register_hook inside forward

Gaoyuan_Yang · July 13, 2020, 10:35am

Hi, I want to change part of intermediate layer’s activation to zero during forward pass,
suppose my forward function is like this:

def forward(self, x):
    x = self.l1(x)
    x = F.relu(x)
    x[:10,:] = 0

    def x_hook(grad):
        grad_clone = grad.clone()
        grad_clone[:10,:]= 0
        return grad_clone
    x.register_hook(x_hook)
    
    output = F.log_softmax(x,dim=1)
    return output

this runs well inside train function, however inside my test function

def test(model, device, test_loader):
    model.eval()
    ...
    with torch.no_grad():
        for data, target in test_loader:
            ...
            output = model(data)
            ...

However, while running the code inside test model, it gives out an error
cannot register a hook on a tensor that doesn’t require gradient
My question is：
(1)Can we put register_hook of torch.tensor inside forward function?
(2)Why register_hook function called inside torch.no_grad() still calculate gradients here?

Any suggestion would be appreciated!

albanD · July 13, 2020, 3:05pm

Hi,

To fix your original question,
You want to add the hook only if x.requires_grad == True.
There are no gradient computed and so to avoid the user registering a hook and having it not being called, we raise an error.

Note that since you override the values of x in a differentiable manner, the gradients for these entries will be zeroed out by the backward of x[:10,:] = 0. So you don’t actually need the hook here!

hsong · May 29, 2022, 5:47am

also solved my problem. Thank you!