QAT backend for training

eefahd · May 2, 2021, 12:49pm

Hi,
With a quantized model, it’s necessary to set the correct backend (fbgemm or qnnpack) for inference.

But in the quantization aware training, does this backend affect the training?

For instance, can I train the quantized model using fbgemm backend and then use it with the qnnpack in the inference phase!

Vasiliy_Kuznetsov · May 6, 2021, 3:23pm

There are a couple of things to keep in mind:

default qconfigs have different settings for qnnpack and fbgemm. One setting in particular, reduce_range, if set to False only works correctly in qnnpack and leads to potential overflow in fbgemm.
when weights are packed, the global backend setting is used to determine whether to pack for fbgemm or for qnnpack.