How to delete PyTorch objects correctly from memory

111 · March 8, 2017, 4:46am

I’m having an issue with properly deleting PyTorch objects from memory. With this Tensor:

test = torch.Tensor(1000,1000)

Then delete the object:

del test

CUDA memory is not freed up.

Is there a clean way to delete a PyTorch object from CUDA memory?

albanD · March 8, 2017, 9:59am

Hi,

It is because the cuda backend uses a caching allocator. This means that the memory is freed but not returned to the device.

if after running del test you allocate more memory with test2 = torch.Tensor(1000,1000), you will see that the memory usage will stay exactly the same: it did not re-allocated memory but re-used the one that had been freed when you ran del test.

111 · March 8, 2017, 10:06am

Ah Thanks a lot! It’s really helpful!

nionjo · November 29, 2018, 5:33pm

Hi,
Which function do you use to monitor the gpu memory usage?
Thanks.

albanD · November 30, 2018, 10:19am

You can find them in the doc here.

SANTOSH_S · July 8, 2020, 8:18pm

Hi @albanD,
I am using different models for inference. But my gpu space is only 10gb.
How do I unload a model from cuda and switch/load another model to cuda?

albanD · July 8, 2020, 8:31pm

Hi,

Doing model.cpu() will move it back to the cpu.
Assuming that nothing else references the weights, they will be freed and returned to our allocator to be used for other Tensors.

ndvbd · May 23, 2023, 1:37pm

Here’s a miminal example showing that memory is not freed. I want to complete free the tensor memory from GPU, and to be able to see it in nvidia-smi:


import os
os.environ["CUDA_VISIBLE_DEVICES"] = "6"
import torch
from transformers import LlamaForCausalLM

model = LlamaForCausalLM.from_pretrained(
            pretrained_model_name_or_path='decapoda-research/llama-7b-hf',
            load_in_8bit=True,
            device_map={'': 0},
        )

del model
torch.cuda.empty_cache()

print('breakpoint here - is memory freed?')

ptrblck · May 23, 2023, 4:57pm

Seems to be solved here.