How to compute the Jacobian matrix of the network's parameters?

S_M · January 29, 2020, 2:39pm

I am required to compute my loss function as below:

where “Theta” is the parameters of the network (i.e. weights) and “f(Theta)” is the output network and “y” is a real label and “x” is the input sample.

would you please give me a tip to do that, by sample code?
how can I compute this in Pytorch during the train of my network?
Thanks.

albanD · January 29, 2020, 4:22pm

Hi,

You can find in this gist a function that shows how to compute the hessian.

But you can compute the dot product with f-y much more efficiently using Rop from this gist. In particular it will take Rop(loss, theta, f(theta) - y).

S_M · January 30, 2020, 5:13am

Hi albanD,
How about the first gist, Do you mean to use the following function?
where “y” is “loss” and “x” is “theta” in my example. Is it right?

import torch

def jacobian(y, x, create_graph=False):
jac = []
flat_y = y.reshape(-1)
grad_y = torch.zeros_like(flat_y)
for i in range(len(flat_y)):
grad_y[i] = 1.
grad_x, = torch.autograd.grad(flat_y, x, grad_y, retain_graph=True, create_graph=create_graph)
jac.append(grad_x.reshape(x.shape))
grad_y[i] = 0.
return torch.stack(jac).reshape(y.shape + x.shape)

Thank you.

albanD · January 30, 2020, 6:35pm

Yes you want the gradient of the loss wrt theta.