Please find the full log text, which is too large (45M txt file), in the link below.
https://drive.google.com/file/d/1DRdwq3QjFbD6MyDZj0YHNtiuj2LFjqII/view?usp=sharing
Partial log text is given below.
========= COMPUTE-SANITIZER
========= Invalid __global__ read of size 8 bytes
========= at 0x1470 in void cusparse::load_balancing_kernel<(unsigned int)256, (unsigned int)1, (unsigned long)0, int, int, cusparse::CsrMMOpAlg1<cusparse::CsrMMPolicyAlg1<int, double, double, double>, (bool)0, (bool)0, (bool)1, double, int>, int, double, double, double>(const T5 *, T4, T5, T5, int, const T4 *, T6, T7 *...)
========= by thread (64,0,0) in block (0,71,0)
========= Address 0x7f5a24c00000 is out of bounds
========= and is 1 bytes after the nearest allocation at 0x7f5a24a00000 of size 2,097,152 bytes
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x3050c2]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0x8fea4b]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/../../../../libcusparse.so.11
========= Host Frame: [0x95b5e8]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/../../../../libcusparse.so.11
========= Host Frame: [0x824b1]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/../../../../libcusparse.so.11
========= Host Frame: [0x4b9d79]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/../../../../libcusparse.so.11
========= Host Frame: [0x4f836d]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/../../../../libcusparse.so.11
========= Host Frame:cusparseSpMM [0xff1c9]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/../../../../libcusparse.so.11
========= Host Frame:void at::native::sparse::cuda::(anonymous namespace)::_csrmm2<double>(char, char, long, long, long, long, double*, double*, int*, int*, double*, long, double*, double*, long, cudaDataType_t) [clone .constprop.0] [0x2e8c222]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:void at::native::sparse::cuda::csrmm2<double>(char, char, long, long, long, long, double, double*, int*, int*, double*, long, double, double*, long) [0x2e8e2f5]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:at::native::s_addmm_out_csr_sparse_dense_cuda_worker(long, long, long, long, at::Tensor const&, c10::Scalar const&, at::Tensor const&, c10::Scalar const&, at::Tensor const&, at::Tensor const&, at::Tensor const&, at::Tensor const&)::{lambda()#1}::operator()() const::{lambda()#1}::operator()() const [0x2e7e920]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:at::native::s_addmm_out_csr_sparse_dense_cuda_worker(long, long, long, long, at::Tensor const&, c10::Scalar const&, at::Tensor const&, c10::Scalar const&, at::Tensor const&, at::Tensor const&, at::Tensor const&, at::Tensor const&) [0x2e818e4]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:at::native::s_addmm_out_sparse_dense_cuda_worker(long, long, long, long, at::Tensor&, c10::Scalar const&, at::Tensor const&, c10::Scalar const&, at::Tensor&, at::Tensor&, at::Tensor const&) [0x29639c1]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:at::native::s_addmm_out_sparse_dense_cuda(at::Tensor&, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x2964086]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:at::native::s_addmm_sparse_dense_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x2964880]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:at::native::addmm_sparse_dense_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x2964df7]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:at::(anonymous namespace)::(anonymous namespace)::wrapper_SparseCUDA__addmm(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x2d8833d]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:c10::impl::wrap_kernel_functor_unboxed_<c10::impl::detail::WrapFunctionIntoFunctor_<c10::CompileTimeFunctionPointer<at::Tensor (at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_SparseCUDA__addmm>, at::Tensor, c10::guts::typelist::typelist<at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&> >, at::Tensor (at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x2d883cd]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so
========= Host Frame:at::_ops::addmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x21195a1]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:at::native::_sparse_addmm(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x1daa50a]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:c10::impl::wrap_kernel_functor_unboxed_<c10::impl::detail::WrapFunctionIntoFunctor_<c10::CompileTimeFunctionPointer<at::Tensor (at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CompositeExplicitAutograd___sparse_addmm>, at::Tensor, c10::guts::typelist::typelist<at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&> >, at::Tensor (at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x28b86dd]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:at::_ops::_sparse_addmm::redispatch(c10::DispatchKeySet, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x20af9b6]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:torch::autograd::VariableType::(anonymous namespace)::_sparse_addmm(c10::DispatchKeySet, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x3c0a623]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:c10::impl::wrap_kernel_functor_unboxed_<c10::impl::detail::WrapFunctionIntoFunctor_<c10::CompileTimeFunctionPointer<at::Tensor (c10::DispatchKeySet, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&), &torch::autograd::VariableType::(anonymous namespace)::_sparse_addmm>, at::Tensor, c10::guts::typelist::typelist<c10::DispatchKeySet, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&> >, at::Tensor (c10::DispatchKeySet, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x3c0afe3]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:at::_ops::_sparse_addmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) [0x2119231]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:at::native::_sparse_mm(at::Tensor const&, at::Tensor const&) [0x1daf233]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:c10::impl::wrap_kernel_functor_unboxed_<c10::impl::detail::WrapFunctionIntoFunctor_<c10::CompileTimeFunctionPointer<at::Tensor (at::Tensor const&, at::Tensor const&), &at::(anonymous namespace)::(anonymous namespace)::wrapper_CompositeImplicitAutograd___sparse_mm>, at::Tensor, c10::guts::typelist::typelist<at::Tensor const&, at::Tensor const&> >, at::Tensor (at::Tensor const&, at::Tensor const&)>::call(c10::OperatorKernel*, c10::DispatchKeySet, at::Tensor const&, at::Tensor const&) [0x2a83fb0]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:at::_ops::_sparse_mm::call(at::Tensor const&, at::Tensor const&) [0x23e3741]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so
========= Host Frame:torch::autograd::THPVariable__sparse_mm(_object*, _object*, _object*) [0x66183d]
========= in /home/srinath/miniconda3/envs/gpm/lib/python3.10/site-packages/torch/lib/libtorch_python.so
========= Host Frame:/usr/local/src/conda/python-3.10.13/Objects/methodobject.c:554:cfunction_call [0xfc697]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Objects/call.c:216:_PyObject_MakeTpCall [0xf614b]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Python/ceval.c:4181:_PyEval_EvalFrameDefault [0xf2376]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Python/ceval.c:5074:_PyEval_Vector [0x191d92]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Python/ceval.c:1135:PyEval_EvalCode [0x191cd7]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Python/pythonrun.c:1292:run_eval_code_obj [0x1c2967]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Python/pythonrun.c:1313:run_mod [0x1bdad0]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Python/pythonrun.c:1208:pyrun_file.cold [0x5956b]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Python/pythonrun.c:456:_PyRun_SimpleFileObject [0x1b805f]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Python/pythonrun.c:90:_PyRun_AnyFileObject [0x1b7dc3]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Modules/main.c:670:Py_RunMain [0x1b4b7d]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:/usr/local/src/conda/python-3.10.13/Modules/main.c:1091:Py_BytesMain [0x184e49]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
========= Host Frame:../sysdeps/nptl/libc_start_call_main.h:74:__libc_start_call_main [0x23a90]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame:../csu/libc-start.c:347:__libc_start_main [0x23b49]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x184cfe]
========= in /home/srinath/miniconda3/envs/gpm/bin/python
.
.
.
.
Invalid __global__ read of size 8 bytes
========= at 0x1470 in void cusparse::load_balancing_kernel<(unsigned int)256, (unsigned int)1, (unsigned long)0, int, int, cusparse::CsrMMOpAlg1<cusparse::CsrMMPolicyAlg1<int, double, double, double>, (bool)0, (bool)0, (bool)1, double, int>, int, double, double, double>(const T5 *, T4, T5, T5, int, const T4 *, T6, T7 *...)
========= by thread (64,0,0) in block (0,71,0)
========= Address 0x7f5a24c00000 is out of bounds