Hi, our machine restarted and now PyTorch is unable to work with CUDA, despite working before the restart. When I type torch.cuda.device_count()
I get the following error:
/home/username/miniconda3/envs/tvae/lib/python3.8/site-packages/torch/cuda/__init__.py:80: UserWarning: CUDA initialization: CUDA unknown error - this may be due to an incorrectly set up environment, e.g. changing env variable CUDA_VISIBLE_DEVICES after program start. Setting the available devices to be zero. (Triggered internally at /home/conda/feedstock_root/build_artifacts/pytorch-recipe_1635023442742/work/c10/cuda/CUDAFunctions.cpp:112.) return torch._C._cuda_getDeviceCount() > 0
Below is my PyTorch version, as well as the output of nvidia-smi
and nvcc -V
. Please let me know if you need any additional information.
Potentially relevant nvidia-smi
output:
`Fri Jun 10 12:27:40 2022
±----------------------------------------------------------------------------+
| NVIDIA-SMI 455.23.05 Driver Version: 455.23.05 CUDA Version: 11.1 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Quadro RTX 6000 On | 00000000:17:00.0 Off | Off |
| 34% 27C P8 17W / 260W | 6MiB / 24220MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+
| 1 Quadro RTX 6000 On | 00000000:65:00.0 On | Off |
| 34% 30C P8 4W / 260W | 131MiB / 24212MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 10225 G /usr/lib/xorg/Xorg 4MiB |
| 0 N/A N/A 10579 G /usr/bin/gnome-shell 0MiB |
| 1 N/A N/A 10225 G /usr/lib/xorg/Xorg 101MiB |
| 1 N/A N/A 10579 G /usr/bin/gnome-shell 28MiB |
±----------------------------------------------------------------------------+`
Output of nvcc -V
:
nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2019 NVIDIA Corporation Built on Sun_Jul_28_19:07:16_PDT_2019 Cuda compilation tools, release 10.1, V10.1.243