I am using pytorch function torch.rfft() and torch.irfft() inside the forward path of a model. It runs fine on single GPU. However, when I train the model on multiple GPUs, it fails and gave the error:
RuntimeError: cuFFT error: CUFFT_INTERNAL_ERROR
Does anybody has the intuition why this is the case? Thanks!