One of the variables needed for gradient computation has been modified by an inplace operation error occured

ptrblck · July 18, 2021, 4:53am

You could be hitting this issue which would be raised in case a backward pass tries to compute gradients with already updated parameters and thus also stale forward activations.