Compare Deployment Types To Deploy A Model


After training a model, I want to deploy this model in production. But there are many kinds of types such as TorchScript, ONNX, and TorchServe.
I read about them but it is quite difficult to understand all of them.
So could you explain them easily and compare them? Which one is the best for production?

ps: I tested inference normally by a pytorch module and a torchscipt model in the python environment. The inference time of the pytorch module is faster. It’s quite hard to understand.

Help me, thanks

Does anybody help me? Please.