How to optimize the term with "gradient wrt input"?

phantom90 · May 19, 2021, 4:09pm

Hi there,

I encounter with this problem, how to compute:

in pytorch? Here, X is the model input, Y is the prediction, \theta is the parameter.

Thanks in advance~