Will DataLoader deallocate the CPU memory after one batch training?

Zoe_Zhao · September 12, 2025, 5:22pm

Hi!

For my experience, when training a Dataset with Pytorch, the CPU occupied memory will keep stable. Is that because the DataLoader automatically deallocates the CPU memory occupied by the previous batch of torch tensors? May I ask how it happened? Where can I know the more details about the DataLoader?

valerian.rey · September 14, 2025, 9:05am

This is automatically handled by Python’s garbage collector. When a Tensor stops being referenced, it will be garbage collected after a while (not necessarily right after the batch has been used) and will thus stop using memory.