Different output between server and android after using static quantization fx

I tried static quantization on deeplabv3 image segmentation model

the quantization that I use refer to this method
https://pytorch.org/tutorials/prototype/fx_graph_mode_ptq_static.html

the output is similar to before quantization when performed on server, but on android it is defferent.

Does any one idea what cause this?

After further checking, the output is actually the same, but since in android the output is only one dimension, the order is incorrect and output need to be transposed first to get the correct dimension shape as before quantized.

what make me curious now why the order change? and can it be prevented?