Torch.compile [WARNING] not enough SMs to use max_autotune_gemm mode

anass_017 · July 17, 2023, 10:08am

I have a small model with 900K in wights

compiled_model = torch.compile(net_model,backend="inductor",mode="max-autotune")

When I compile the model using max_autotune model, he is giving me this warning :
[WARNING] not enough SMs to use max_autotune_gemm mode and i don't know how to exploit thi feature.

PS: I’m working on complex- valued models model with a ComplexConvolution class that uses two nn.Conv2D (for real and imaginary part )

ptrblck · July 17, 2023, 4:05pm

The warning seems to be raised here, which assumes your GPU needs at least 80 SMs for this mode, which seems to be a hard-coded limitation in torch.compile.

anass_017 · July 20, 2023, 9:32am

OK, i get it. I have only 48 streaming, thank you for your response.

Prakhar · May 30, 2024, 7:29am

Did you somehow resolve this and were able to use the max-autotune mode?