Hello there,
Since pytorch==1.3
has released there is an ability to perform quantization of the model during training time. And as I understand quantization works with QNNPACK.
Before that we had a way to run caffe2 models using QNNPACK too.
The question is following is there a way to quantize caffe2 model (already trained) today?