Error: identifier "THCudaBlas_Sgemv" is undefined

gouthamvgk · July 24, 2020, 10:50pm

I am trying to compile DCNv2 layer from this repo https://github.com/CharlesShang/DCNv2. While doing so my compilation fails with this error in stack trace “src/cuda/dcn_v2_cuda.cu(324): error: identifier “THCudaBlas_Sgemv” is undefined”. Any help??

ptrblck · July 26, 2020, 9:18am

It seems, the method was already updated as seen here.

gouthamvgk · July 28, 2020, 2:59pm

Seems that THCudaBlas_Sgemv has been moved to at::cuda::blas::gemv. But after compiling the dcn with the new API linkage error shows up like undefined symbol: _ZN2at4cuda4blas4gemvIfEEvcllT_PKS3_lS5_lS3_PS3_l . After going through torch repo i couldnt find the definition of at::cuda::blas::gemv anywhere. Am I missing something??

ptrblck · July 29, 2020, 4:52am

The method definition starts in this line of code and defines different dtypes etc.

Some examples on its usage are e.g. in SpatialConvolutionMM.cu.

lbin · July 29, 2020, 12:26pm

Try this https://github.com/lbin/DCNv2/tree/pytorch_1.6

Felix_Lessange · February 5, 2021, 11:09am

Using nvidia tensorrt 20.11 image and pytorch 1.7.1,
Running this

find . -type f -exec sed -i 's/THCState_getCurrentStream(state)/c10::cuda::getCurrentCUDAStream()/g' {} \;

And replace external/DCNv2/src/cuda/dcn_v2_cuda.cu file by this version DCNv2/dcn_v2_cuda.cu at c916fdf2ed69ebebc43cbe8a2cb5e7ce0dd71132 · lbin/DCNv2 · GitHub

I manage to get a successful build, but I have the same issue mentionned above when importing dcn_v2 :

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "yolact/external/DCNv2/dcn_v2.py", line 13, in <module>
    import _ext as _backend
ImportError: yolact/external/DCNv2/_ext.cpython-36m-x86_64-linux-gnu.so: undefined symbol: _ZN2at4cuda4blas4gemvIfEEvcllT_PKS3_lS5_lS3_PS3_l