Modifying forward/backward pass

ptrblck January 6, 2023, 12:48am 2

Your use case sounds similar to CPU offloading, which uses torch.autograd.graph.saved_tensors_hooks or torch.autograd.graph.save_on_cpu if I’m not mistaken, so you could take a look at these context managers.

Does the pytorch has a tool to convert data from GPU to CPU and from CPU to GPU automatically when the GPU memory is not enough?

How do I rewrite the GPU memory allocation algorithm of PyTorch?

CPU and GPU memory

CUDA OOM because of tensor gradients?

How to Use More Process Memory

Can PyTorch move a tensor along with its computational graph from GPU to CPU, and then move it back to GPU for backpropagation?

Is it possible to directly write activation to CPU memory?

Instance segmentation on big images

Method for efficiently transferring non-autograd tensors to CPU from GPU?