For one of my tasks, suppose we have a module M parameters by $\theta$.
$\theta$ denotes a collections of parameters, such as the parameters of a linear layer, within the module M.
Is there a way to calculate:
H_{\theta}
i.e., the Hessian matrix w.r.t $\theta$? Any suggestions or references are welcomed.
Thanks very much ~