RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling `cublasCreate(handle)`

@ptrblck
Hello, I am facing a similar problem, and I created a topic fo rit, I would love your help:
https://discuss.pytorch.org/t/facebook-bart-fine-tuning-transformers-cuda-error-cublas-status-not-initialize/178641