Why torch.compile slows down from second run?

Nirmai · June 6, 2023, 2:23am

The recently released torch 2.0 works great, torch.compile() makes models to run faster. I have tried few experiments where I observed the first run is taking time for compilation and from the second run it speeds up the model.

What is the reason behind it? Even for the second run if we instantiate the model and run the model again from start it takes lesser time. Is torch.compile() stores something at backend?

eqy · June 6, 2023, 3:36am

If you are saying that “compiling” the model again takes less time, then I believe this is expected as the compiled kernels are cached: PT 2.0 - Are compiled models savable - #2 by smth.

Nirmai · June 6, 2023, 6:13am

Is there a way to access this cache, from where can we do this?

eqy · June 6, 2023, 6:22am

I would see if e.g., torch._dynamo — PyTorch 2.0 documentation does what you are looking for

Nirmai · June 7, 2023, 4:03am

The link above shows how to reset the cache, is there any way to save this cache and use it back by pickling?

adal · October 13, 2023, 9:00am

I met with the same problem. Are there any updates?