Hi everyone, I tried to run the pointnet module on Pytorch but got the following error:
RuntimeError: cublas runtime error : the GPU program failed to execute at /opt/conda/conda-bld/pytorch_1549635019666/work/aten/src/THC/THCBlas.cu:258
It shows the error comes from F.linear(input, self.weight, self.bias).
The whole error message is:
==============
Traceback (most recent call last):
File “/home/yirus/software/pycharm-community-2018.3.4/helpers/pydev/pydevd.py”, line 1741, in
main()
File “/home/yirus/software/pycharm-community-2018.3.4/helpers/pydev/pydevd.py”, line 1735, in main
globals = debugger.run(setup[‘file’], None, None, is_module)
File “/home/yirus/software/pycharm-community-2018.3.4/helpers/pydev/pydevd.py”, line 1135, in run
pydev_imports.execfile(file, globals, locals) # execute the script
File “/home/yirus/software/pycharm-community-2018.3.4/helpers/pydev/_pydev_imps/_pydev_execfile.py”, line 18, in execfile
exec(compile(contents+"\n", file, ‘exec’), glob, loc)
File “/home/yirus/Projects/pointnet.pytorch/train_classification.py”, line 78, in
pred, _ = classifier(points)
File “/home/yirus/anaconda3/lib/python3.7/site-packages/torch/nn/modules/module.py”, line 489, in call
result = self.forward(*input, **kwargs)
File “/home/yirus/Projects/pointnet.pytorch/pointnet.py”, line 100, in forward
x, trans = self.feat(x)
File “/home/yirus/anaconda3/lib/python3.7/site-packages/torch/nn/modules/module.py”, line 489, in call
result = self.forward(*input, **kwargs)
File “/home/yirus/Projects/pointnet.pytorch/pointnet.py”, line 100, in forward
x, trans = self.feat(x)
File “/home/yirus/anaconda3/lib/python3.7/site-packages/torch/nn/modules/module.py”, line 489, in call
result = self.forward(*input, **kwargs)
File “/home/yirus/Projects/pointnet.pytorch/pointnet.py”, line 73, in forward
trans = self.stn(x)
File “/home/yirus/anaconda3/lib/python3.7/site-packages/torch/nn/modules/module.py”, line 489, in call
result = self.forward(*input, **kwargs)
File “/home/yirus/Projects/pointnet.pytorch/pointnet.py”, line 47, in forward
x = F.relu(self.bn4(self.fc1(x)))
File “/home/yirus/anaconda3/lib/python3.7/site-packages/torch/nn/modules/module.py”, line 489, in call
result = self.forward(*input, **kwargs)
File “/home/yirus/anaconda3/lib/python3.7/site-packages/torch/nn/modules/linear.py”, line 67, in forward
return F.linear(input, self.weight, self.bias)
File “/home/yirus/anaconda3/lib/python3.7/site-packages/torch/nn/functional.py”, line 1352, in linear
ret = torch.addmm(torch.jit._unwrap_optional(bias), input, weight.t())
==============
I tried to search on the Internet but cannot find any posts that are helpful.
======================
Here is the versions installed:
GPU info: GeForce RTX 2080
$ nvidia-smi
It shows “Driver Version: 415.18, CUDA version: 10.0”
$nvcc -V
It shows “Cuda compilation tools, release 9.2, V9.2.148”
Pytorch version: 1.0.1.post2 (installed by anaconda3)
Operation system: Fedora 28
Python version: 3.7
I have a feeling that it’s related to the wrong version of CUDA or should I reinstall pytorch?
Thanks in advance!