Pytorch GPU question

n4tman · August 18, 2020, 5:10pm

Hi All,

Just wanted to ask

I do device=torch.device('cuda' if torch.cuda.is_available() else 'cpu') or device=torch.device('cuda:0' if torch.cuda.is_available() else 'cpu') and then just move my data, tensors I want to use and my model to the device by doing .to(device) I am only using 1 GPU?

Thanks!

ptrblck · August 20, 2020, 9:40am

Yes, both approaches would use a single GPU only.

n4tman · August 21, 2020, 8:50pm

Thanks. Would this still be the case if I had multiple GPUs available to pytorch?

ptrblck · August 21, 2020, 10:44pm

Yes, this would still be the case and you could use nn.DataParallel or nn.DistributedDataParallel to use multiple devices.
This tutorial gives you more information.

n4tman · September 1, 2020, 9:31am

Thanks @ptrblck. Assuming I have at least 1 GPU, by default would the same GPU be used in both approaches?

ptrblck · September 1, 2020, 10:27am

If your system has only a single GPU, I think both approaches should fall back to using the single device.

n4tman · September 5, 2020, 1:39pm

Thanks @ptrblck. (referring to my original question) Just to confirm would the 2 approaches I specified at the top of the thread only use 1 GPU even if torch.cuda.device_count() gave a number greater than 1?

ptrblck · September 5, 2020, 6:42pm

Yes, if you are explicitly moving the model and data to a specific device, only this device will be used.
That will be the case as long as you don’t use e.g. nn.DataParallel.
Note that (depending on your code) PyTorch might create a CUDA context on other visible devices.
If you see this behavior and want to avoid it, you could mask the desired device via CUDA_VISIBLE_DEVICES=0 python script.py args.

n4tman · September 5, 2020, 7:27pm

Thanks @ptrblck (again referring to my original question) if I have multiple GPUs on the system I execute my code, the 2 approches only use 1 GPU?

Also what exactly does a CUDA context mean and is is something I should avoid?

ptrblck · September 7, 2020, 2:50am

Yes.

The CUDA context stores the GPU kernels, runtime etc., and thus uses memory on the specified device.

n4tman · September 7, 2020, 1:33pm

Thanks @ptrblck. Should cuda contexts be avoided?

ptrblck · September 7, 2020, 6:31pm

No, as it’s holding the CUDA kernels. If you don’t initialize it, you won’t be able to run code of your GPU.

n4tman · September 7, 2020, 7:48pm

Thanks @ptrblck. So having cuda contexts on GPUs I don’t move anything to are fine?