Computing output gradients for NTK

ptrblck · April 12, 2021, 5:16am

You could either reduce the ouputs to a scalar value and call .backward() on it or you could alternatively pass the gradients to the backward operation.
@albanD explains the reasoning behind this in this post.