At a high level it depends on the kernel/backend being used, you’d be better off asking the fbgemm or qnnpack folks. At a lower level though I believe the broad strokes are correct.
Here is a document that may be more helpful: gemmlowp/quantization.md at master · google/gemmlowp · GitHub