PyTorch cannot find GPU

Shangtong_Zhang · June 1, 2018, 3:52am

I tried to install pytorch on a cluster, which is centos 6.9 and I don’t have a root account. The default glibc is too old, so I first installed a newer glibc version and compiled python myself.
I start my python with

/cshome/shangton/apps/glibc/lib/ld-2.17.so --library-path /cshome/shangton/apps/glibc/lib:/cshome/shangton/apps/python3/lib:/lib64:/cshome/shangton/miniconda3/lib:/usr/local/cuda-8.0/lib64 /cshome/shangton/apps/python3/bin/python3

But torch doesn’t seem to find the device,

Python 3.6.3 (default, May 31 2018, 20:42:57)
[GCC 4.4.7 20120313 (Red Hat 4.4.7-17)] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import torch
>>> torch.cuda.device_count()
0
>>>

ptrblck · June 1, 2018, 9:47am

Did you see any information regarding CUDA or your GPU during the build from source?
Were any warnings thrown?

Shangtong_Zhang · June 1, 2018, 2:47pm

Oh I didn’t build from source, just pip install torch (or specify a link with cuda myself)
I will try to build it from source.

ptrblck · June 1, 2018, 2:50pm

Sorry, I thought you already compiled PyTorch from source.
Anyway, did you make sure to use the compiled python version?
You can check it with which python3.

Shangtong_Zhang · June 1, 2018, 9:21pm

I use absolute path to call python (the long command in the first line)

Shangtong_Zhang · June 1, 2018, 10:07pm

I finally made it,

/cshome/shangton/apps/glibc/lib/ld-2.17.so --library-path /cshome/shangton/apps/glibc/lib:/cshome/shangton/apps/python3/lib:/lib64:/cshome/shangton/miniconda3/lib:/usr/local/cuda-8.0/lib64:/usr/lib64 /cshome/shangton/apps/python3/bin/python3

Add one more lib path