Why torch.compile slows down from second run?

If you are saying that “compiling” the model again takes less time, then I believe this is expected as the compiled kernels are cached: PT 2.0 - Are compiled models savable - #2 by smth.