What is the cpu() in pytorch

J_Na · March 16, 2018, 5:11am

correct += pred_eq(target.data).cpu().sum()

What`s the meaning of cpu() ?
Can anyone understand that code ?

Diego · March 16, 2018, 6:01am

This is used to move the tensor to cpu(). Some operations on tensors cannot be performed on cuda tensors so you need to move them to cpu first.

jpeg729 · March 16, 2018, 7:45am

tensor.cuda() is used to move a tensor to GPU memory.
tensor.cpu() moves it back to memory accessible to the CPU.

vinaykumar2491 · September 7, 2018, 7:18am

But after doing tensor.cpu() when I check the device of tensor using tensor.device it gives the original cuda:0 where it was before moving to cpu. How can I be sure that the tensor is moved to CPU?

ptrblck · September 7, 2018, 12:25pm

You have to assign the tensor after moving:

tensor = tensor.cpu()
# or using the new method
tensor = tensor.to('cpu)

vinaykumar2491 · September 8, 2018, 11:55am

Thanks @ptrblck. I see that .cpu() or .cuda() works differently for model and tensor.

peterkim95 · January 11, 2021, 7:03am

Is there an official list of operations that are unable to performed on cuda tensors?

I’m wondering if there is some guideline to understand when you can and cannot perform certain operations?

Thanks!

mary-ashk · January 29, 2024, 12:11pm

I have a big model and my RAM is 6.
Is it helpful to send some tensors to CPU, and others remain on cuda to solve the cuda out of memory error?

ptrblck · January 29, 2024, 1:41pm

I don’t know what “my RAM is 6” means, but CPU offloading can generally reduce the GPU memory usage.