Does pytorch support tools to get a quant performance with a fp32 model

fwz-opt · April 29, 2021, 2:58am

If I have a pytorch script model with fp32 datatype.
I want to measure the quant performance on modile with qnnpack( just use this fp32 model but choose to use int8 as inference datatype).
I just want to know with the same net architecture, the performance difference between fp32 and int8.
Does pytorch has this kind of tools? Like TensorRT trtexec --int8 with fp32 model

supriyar · April 30, 2021, 5:57am

This thread might be useful for you Speed benchmarking on android?

Please reach out to the mobile team if this script doesn’t work as expected.