PyTorch optimization for CPU

Hey guys, we are interested in deploying our models on cloud CPUs as they are significantly cheaper than GPUs/TPUs etc.

Intel used to support cpu-based deployment:

What is the current status of CPU optimization for PyTorch models?

Is there a preferred distribution (conda etc.) to use?