Thank you so much @ptrblck for your explanations, I end-up plotting the training losses both ways and the losses calculated in model.train()
is indeed follow the losses calculated in model.eval()
.
Also, thanks for pointing out the issues in my code!