Iterative Loss: Loss from first iteration when added in the second iteration does not work

ptrblck · April 23, 2021, 7:24pm

I guess you are seeing an error, since you are trying to calculate the gradients (in iter2) from stale forward activations (calculated in iter1).
Have a look at this post for more information and check, if you are hitting the same error.