Getting Triton to generate all kernels

Yeah all you need to do is set TORCH_LOGS="output_code" python train.py and you’ll get the kernels printed

1 Like