I’m using Automatic Mixed precision and within my model’s forward pass I make use of torch.fft.rfft2() and torch.fft.irfft2(). For some reason this seems to work find on my laptop, but when using a remote node via slurm throws the following error
RuntimeError: cuFFT only supports dimensions whose sizes are powers of two when computing in half precision
As far as I can tell I’m using the same versions of pytorch on each
Remote:
pytorch 2.5.1 py3.12_cuda12.4_cudnn9.1.0_0 pytorch
pytorch-cuda 12.4 hc786d27_7 pytorch
Laptop:
pytorch 2.5.1 py3.12_cuda12.4_cudnn9_0 pytorch
pytorch-cuda 12.4 h3fd98bf_7 pytorch
I could just increase the padding to next power of 2….however this drastically increases the size of the two matrices I’m then looking to multiply together.