Loss.backward() throws error because output tensor has no gradient

ptrblck · August 2, 2024, 11:24am

Good catch! Indeed, replacing the tensor with constants won’t work. The threshold is not usefully differentiable since its gradient would be zeros everywhere and undefined or Inf at the rounding points. This post might be useful.