CUDA out of memory problem

mathwseg · July 15, 2022, 6:56am

i tried to use those line

model.eval()
device = torch.device("cuda")
model.to(device)

but got
RuntimeError: CUDA out of memory. Tried to allocate 16.00 MiB (GPU 0; 5.80 GiB total capacity; 4.52 GiB already allocated; 4.56 MiB free; 4.53 GiB reserved in total by PyTorch)
in this line
model.to(device)

ptrblck · July 15, 2022, 6:59am

As the error message describes your GPU doesn’t have enough free memory and cannot allocate new memory to push the model to the GPU.
Try to kill other processes which might consume memory and rerun your script.

mathwseg · July 15, 2022, 7:00am

Thanks for replying , i checked the nvidia-smi and didn’t find any process using GPU

ptrblck · July 15, 2022, 7:01am

In that case your script seems to allocate the memory before the model.to() call and you would need to reduce the memory usage.

mathwseg · July 15, 2022, 7:04am

i’m using batch_size =1 how can i clear the memory before i got this problem , please

mathwseg · July 17, 2022, 5:47am

is there any help please

ptrblck · July 17, 2022, 6:09am

You could try to del unneeded tensors before creating the model.
However, it depends on your use case if deleting tensors etc. from the device would actually work or if you would need them.

mathwseg · July 17, 2022, 8:11am

where exactly i should del the tensor , please
the error exactly in this line vec=model(preprocess(image).unsqueeze(0).cuda()) i used a counter after vec1 and got the tensor for the first image then throw this error

ptrblck · July 17, 2022, 6:54pm

It seems to error now moved from the model.to() operation to the actual forward pass.
If the only data on the GPU are the model’s parameters and the input tensor, your model might be too large for your device. You could try to use e.g. torch.utils.checkpoint to trade compute for memory.

mathwseg · July 17, 2022, 9:07pm

thanks a lot for your time but can i extract the features using CPU and used it in my task or it will be a problem due to large size

ptrblck · July 17, 2022, 9:13pm

I don’t know how high the peak memory usage for your entire workflow is, but I would assume that your system has more host RAM than the GPU (6GB), so it’s worth a try.