What are the recommended ways to deploy a pytorch model to a desktop machine (with 1080ti GPUs) for fast inference?
Related to this, does NVIDIA TensorRT speed up PyTorch model inference on 1080ti GPUs? If so, are then any benchmarks showing by how much for typical deep learning models?