CUBLAS_STATUS_EXECUTION_FAILED when calling `cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)`

Hello @ptrblck ,

I have an RTX2070 GPU and I tried installing torch with cuda via conda, by following the instructions from here https://pytorch.org/.
I tried with both cuda 10.2 and cuda 11.1. I also tried installing the conda packages or to directly install the packages via pip, in the conda env.
I am now trying to install from source but I am stuck here as well with this problem :frowning: