RuntimeError: cublas runtime error : the GPU program failed to execute at /pytorch/aten/src/THC/THCBlas.cu:450

torch.bmm raise this error , how could I solve this problem please

Could you post an executable code snippet as well as your setup (PyTorch, CUDA versions, used GPU etc.) so that we can take a look and try to reproduce this issue?