Cuda is available only for a short time after reboot on PyTorch

scarecrow21 · September 14, 2019, 11:27am

OS: Ubuntu 18.04
GPU: Nvidia - RTX 2060, NVIDIA-SMI 430.26, Driver Version: 430.26, CUDA Version: 10.2
Laptop: Lenovo Legion Y540

I setup my GPU for PyTorch based on this post: https://forums.fast.ai/t/successful-ubuntu-18-04-with-igpu-for-xserver-and-nvidia-gpu-for-cuda-work-setup/20128/9

I have a hybrid setup. Intel Integrated GPU is used for display and NVIDIA for deep learning. Since CUDA comes bundled with PyTorch, I haven’t installed it separately. I have Installed pytorch using: conda install pytorch torchvision cudatoolkit=10.0 -c pytorch.

$ lspci | grep VGA
00:02.0 VGA compatible controller: Intel Corporation Device 3e9b
01:00.0 VGA compatible controller: NVIDIA Corporation Device 1f11 (rev a1)

torch.cuda.is_available() returns True only for a short while after I boot my machine. After that, it returns False.

I don’t understand what the issue is. Cuda seems to available for short window of time after I boot my machine. If I miss that window, I need to reboot to be able to use the GPU in PyTorch. Please help!

ptrblck · September 14, 2019, 11:12pm

Could you check some power settings and make sure the GPU isn’t deactivated to save battery power?

scarecrow21 · September 22, 2019, 6:31am

I did, I did not find any such setting. In the NVIDIA X Server application, the PRIME profile is set to ‘Intel (power save)’ but that is because I’m using the Intel card for display.
I will experiment some more and see in what cases does this happen. Will update this thread at a later time.