Hi,
After training a model, I want to deploy this model in production. But there are many kinds of types such as TorchScript, ONNX, and TorchServe.
I read about them but it is quite difficult to understand all of them.
So could you explain them easily and compare them? Which one is the best for production?
ps: I tested inference normally by a pytorch lightning module and a torchscipt model in the python environment. The inference time of the pytorch_lightning module is faster. It’s quite hard to understand.
Help me, thanks