Quantization method diff between fake quant and true quant

CapJohn · April 14, 2025, 3:33am

according to the code. The quantization formula for true quant is

int32_t res = round_half_even(value / scale) + zero_point;

and according to the document, the formula for fake quant is

int32_t res = round_half_even(value / scale + zero_point);

Why is there a difference between these two?

jerryzh168 · April 14, 2025, 7:47pm