Accessing accumulated quantized result

samgd · June 1, 2020, 3:51pm

Hi,

The PyTorch quantized operations are great but they return the result after it has been quantized back to 8-bit integer. Is there a simple way of accessing the accumulated result at int32? Are there other libraries, such as Caffe or a different quantization backend, that have this API?

supriyar · June 2, 2020, 3:30am

Re-quantization (conversion from int32 to int8) is performed in the quantized operator libraries - FBGEMM and QNNPACK.
Currently there is no easy way to access this value from pytorch API.