Saving warmed up torchscript models directly

Prashant · October 20, 2020, 7:42pm

Heya I’ve been doing some googling and it looks like the first execution of a torchscript model includes a heavy warmup period. In my case, its almost 5 minutes.

This is pretty painful and I’m wondering if I can directly save the optimized model in the first place so I don’t have to go through this warmup period each time.

eellison · October 20, 2020, 7:59pm

Hi, could you try running it on master or the nightly build? For the 1.7 release compilation times were improved.

Michael_Suo · October 20, 2020, 8:03pm

Sorry to hear that. Unfortunately, we can’t save the warmed-up model today due to a number of optimizations relying on in-memory data structures that can’t be serialized. We have some longer-term infrastructural work planned to improve compile times but it hasn’t landed yet.

Two things that would help us improve the situation:

We recently made a number of improvements to compilation and optimization time, can you try the nightly or the 1.7 RC and see if you get improvements (try running the model a few times).
If possible, can you provide the .pt file for the model that is taking a really long time to warm up, as well as some example inputs? It will help us understand what sort of models are causing long compilation times.

Prashant · October 20, 2020, 8:35pm

Thanks to both you and eellison for the replies.

Using 1.7 seems to have done the trick! Compile times went from 200s -> 2s. When do you think it will make it to stable?

eellison · October 20, 2020, 9:44pm

The 1.7 release is coming soon, I can’t give you an exact date. Stay tuned…