I want to implement the following algorithm, taken from this book, section 13.6
Pytorch requires me to compute a loss for w and for theta.
I am struggling on coming up with that loss term, because it requires to add some factor that depends on the current weights to the derivative - something I don’t really know how to do.
I would appreciate some small code snippet to see how something like this is done with pytorch.
This same question on stackoverflow, with clearer formulas