torch.cuda.OutOfMemoryError when training Mask R-CNN

ptrblck · July 28, 2023, 8:20pm

The model parameter and input size might be tiny compared to the stored intermediates needed for gradient computation.
Have a look at this post showing an example.