You should post your training loop. Growing memory likely occurs because you do not clear the gradients anywhere (zero_grad).
You should post your training loop. Growing memory likely occurs because you do not clear the gradients anywhere (zero_grad).