When I compile the model using max_autotune model, he is giving me this warning : [WARNING] not enough SMs to use max_autotune_gemm mode and i don't know how to exploit thi feature.
PS: I’m working on complex- valued models model with a ComplexConvolution class that uses two nn.Conv2D (for real and imaginary part )
The warning seems to be raised here, which assumes your GPU needs at least 80 SMs for this mode, which seems to be a hard-coded limitation in torch.compile.