RuntimeError: CUDA out of memory. Tried to allocate 3.62 GiB (GPU 0; 47.99 GiB total capacity; 24.39 GiB already allocated; 20.41 GiB free; 24.71 GiB reserved in total by PyTorch)
My free memory (20.41GiB) is LARGER than the memory that it tries to allocate (3.62GiB).
Actually it’s 16.79GiB larger than what it needs.
I tried to find similar case and found one,
In the post they say “it was a number of workers problems, solved it by lowering them”
But I don’t think it applies to my case.
Thanks for reply JuanFMontesinos,
In same code with slightly different setting (added 1 more gpu), I have this:
RuntimeError: CUDA out of memory. Tried to allocate 3.62 GiB (GPU 2; 47.99 GiB total capacity; 13.14 GiB already allocated; 31.59 GiB free; 13.53 GiB reserved in total by PyTorch)
In this case if we add 13.14+13.53=26.67GiB, we still have enough memory to allocate 3.62GiB.
And it doesn’t match your explanation.
I am monitoring using nvidia-smi, but the memory usage never reaches close to maximum.
In some other post, I saw that someone is calculating free memory as:
free memory = reserved memory - allocated memory
In this case, it makes sense.
But then I’m not sure how to reserve more memory, since it looks like it’s done automatically.
I’m also getting a similar type of error.
I have finetuned Yolov5 on custom dataset, it went very well, after getting best weights. I tried to refine those best weights by training for more 30 epochs. But I get cuda out of memory error.
I tried to empty cache memory as Suggested by stack overflow but still can’t get out of this error. Althought memory is not being used by the GPU. I have checked using Nvidia-smi
It says you have only 252.69MiB Cuda memory and you are trying to allocate 250.00MiB, which is pretty close. This makes sense to me that you don’t have enough GPU memory available.
In my case it tries to allocate 3.62GiB but there are 20.41GiB free GPU memory, which I think is weird.