Hi I have this error of :
RuntimeError: CUDA error: CUBLAS_STATUS_EXECUTION_FAILED when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc)
Thanks for the information. I cannot reproduce this issue on a V100 with CUDA11.2 and am not sure what the PyTorch version 19.09 would refer to.
Are you using an NGC container of this old version? If so, it wouldn’t ship with CUDA11.2, so could you post an update about your setup, please?