CUDA out of memory How to fix?

Mr_Tajniak · September 28, 2019, 10:43pm

How to clear GPU cache
torch.cuda.empty_cache() doesn’t work

dejanbatanjac · September 29, 2019, 12:34am

The first question I would ask is the number of GPU cores I have.
For instance if I have 8 cores, then torch.cuda.empty_cache() should be rephrased as something like this:

how_many_gpus = torch.cuda.device_count()
    for _ in range(how_many_gpus):
        torch.cuda.set_device(_)
        torch.cuda.empty_cache()

What have you tried so far?

Mr_Tajniak · September 29, 2019, 8:09am

I have NVIDIA GeForce GTX 1060 6GB

ptrblck · September 29, 2019, 8:18am

.empty_cache will only clear the cache, if no references are stored anymore to any of the data. If you don’t see any memory release after the call, you would have to delete some tensors before.

Mr_Tajniak · September 29, 2019, 8:48am

How can I clear cache? I am using this https://github.com/avinashpaliwal/Super-SloMo

Mr_Tajniak · September 29, 2019, 9:04am

I tried this and
TypeError: device_count() takes 0 positional arguments but 1 was given

Mr_Tajniak · September 29, 2019, 9:56am

What is wrong with this

import torch
how_many_gpus = torch.cuda.device_count()
for _ in range(how_many_gpus):
        torch.cuda.set_device("cuda0")
        torch.cuda.empty_cache()

dejanbatanjac · September 29, 2019, 2:03pm

Have you meant to say you don’t have multiple GPUs? Can you confirm?

dejanbatanjac · September 29, 2019, 2:27pm

Please check out the CUDA semantics document.

Instead, torch.cuda.set_device("cuda0") I would use torch.cuda.set_device("cuda:0"), but in general the code you provided in your last update @Mr_Tajniak would not work for the case of multiple GPUs.

In case you have a single GPU (the case I would assume) based on your hardware, what @ptrblck said:

.empty_cache will only clear the cache, if no references are stored anymore to any of the data. If you don’t see any memory release after the call, you would have to delete some tensors before.

This basically means PyTorch torch.cuda.empty_cache() would clear the PyTorch cache area inside the GPU.

You can check out the size of this area with this code:

import torch

import gc
def p():
    c = torch.cuda.memory_cached()
    print(f'cached   :{c}')
    a = torch.cuda.memory_allocated()
    print(f'allocated:{a}')
    f = torch.cuda.memory_cached()-torch.cuda.memory_allocated()
    print(f'free     :{f}')
    
torch.cuda.empty_cache()
p()
r = torch.randn(1, 128).cuda()
p()

Out:

cached   :0
allocated:0
free     :0
cached   :2097152
allocated:512
free     :2096640

See how PyTorch allocated 2Mb of cache just for storing this 128 floats. If you would del r followed by p() the GPU memory will be free again.

If you would have some objects you haven’t deleted make sure you delete them if they are not needed.

Why did PyTorch cached the memory in advance?
To reuse it later. This is the idea of the cache. We assume this precious resource will be used later.

Mr_Tajniak · September 29, 2019, 4:24pm

I have single GPU
GeForce GTX 1060 6GB

Mr_Tajniak · September 29, 2019, 4:32pm

to free the GPU cache I have to enter

import torch
p()
del r

Yes?

dejanbatanjac · September 29, 2019, 4:47pm

@Mr_Tajniak, all I wanted to say you would need to deal with the variables that consume GPU memory smart.

I haven’t tested project you mentioned, but I just saw you created the issue.

You may expect the feedback in there. Check if OOM is there if you use a smaller video format. Check what are the GPU memory requirements based on the video format you use (size of the frame).

Mr_Tajniak · September 29, 2019, 6:09pm

ok thanks for help. In case of problems I will contact you, ok?

dejanbatanjac · September 29, 2019, 6:13pm

In case of problems I will contact you, ok?

Sure. Why not.