Different output between server and android after using static quantization fx

BillyGun27 · March 8, 2023, 7:29am

I tried static quantization on deeplabv3 image segmentation model

the quantization that I use refer to this method
https://pytorch.org/tutorials/prototype/fx_graph_mode_ptq_static.html

the output is similar to before quantization when performed on server, but on android it is defferent.

Does any one idea what cause this?

BillyGun27 · March 13, 2023, 9:22am

After further checking, the output is actually the same, but since in android the output is only one dimension, the order is incorrect and output need to be transposed first to get the correct dimension shape as before quantized.

what make me curious now why the order change? and can it be prevented?