CUDA out of memory during training

Bohao_Cheung · December 26, 2024, 2:37am

A good suggestion that use with torch.no_grad() in test and validation phase(clear immediate tensors) and detach the loss(remove cached calculating phase) while calculating the total loss!!!