Cross-platform PyTorch stand-alone executables

Deeply · July 2, 2019, 11:45am

At this stage of my project, I need to compile my code to binaries that work under Windows and / or Ubuntu. I know that jit is a great mile stone in PyTorch, but, I am not sure if this is the correct approach that I can use to achieve my objectives; not to mention the complexity and effort needed to use jit; or PyTorch v1.0 C++ Front-End. That said, there are some tools to convert Python to binaries, for example, PyInstaller , which are not compatible with PyTorch. Are there any other alternatives to do PyTorch code compilation?

Thanks in advance

driazati · July 2, 2019, 7:57pm

What difficulties are you seeing with TorchScript (aka jit)? We are trying to make it as easy to use as possible for models defined in Python with PyTorch, so if you’re having issues please let us know. It appears that PyInstaller includes a Python runtime inside the binary, whereas TorchScript and the C++ frontend both have no dependency on Python and can be run in multithreaded environments with no GIL.

As for TorchScript vs. the C++ frontend, that’s a more personal choice as they both rely on libtorch and the difference mostly comes down to how you want to define your model (in Python + TorchScript vs in C++), so it’s hard to say without more information on your use case.

Deeply · July 2, 2019, 9:24pm

Thanks for your answer, which saved me putting lots of sparse effort.

To be honest, I have not tried jit yet; but I need to know which path to take before going forward with it.

From your answer I can say, as I am using PyTorch, then, it’s better to use jit.

soldierofhell · July 14, 2019, 1:24pm

@driazati, but if one needs to (1) train C++ and then (2) serialize model and then (3) load in C++ on the production, do you have to go through TorchScript in (2)? (Ok, I know that there’s ONYX)

driazati · July 16, 2019, 5:34pm

The C++ frontend is a high level API that provides a similar experience as using PyTorch’s Python API, as such it has its own serialization mechanisms, so you can iterate on your model all in C++, this example shows how.

TorchScript lets you go from PyTorch models coded in Python to something that you can load in C++, so it’s not necessary for that case.

soldierofhell · July 27, 2019, 8:37am

Thank you @driazati, but it’s not clear what torch::save do. From API doc seems like full serialization of Module or Tensor, but the comment in example states “checkpoint” (state_dict)? What is the method of serialization, is it something similar to Tensorflow ProtoBuf? Are there aby limitations?

driazati · July 31, 2019, 7:49pm

We have many serialization formats and they’re all different and easy to mix up. We’re working on ways to fix the UX here but for now:

torch.save() in eager mode Python lets you save models so they can be loaded in Python
torch::save() in the C++ API lets you save models so they can be loaded in C++ with torch::load()
torch.jit.save() in eager mode Python lets you save models that have been compiled with TorchScript so they can be loaded in Python with torch.jit.load() and in C++ with torch::jit::load()

soldierofhell · August 1, 2019, 8:03pm

@driazati, thanks, but from your answer it’s still unclear what torch::save() saves: only “weights” or whole model (Module).
Definitely at least doc should be more precise. And to be honest this jit thing is also a little confusing, because this is kind of another world. E.g. in Tensorflow if you load .pb you end with normal tf.Graph not something like tf.pb.Graph