In my origin model,the upsample part is
F.interpolate(l7, scale_factor=2.0, mode='bilinear', align_corners=True)
，when i get QAT model.pt and tried it on android ,the inference time of the model.pt is slow, just similar to the float.pt
So，i changed the upsample part just like
F.interpolate(l7, scale_factor=2.0, mode='nearest')
the inference time is speed up.
But the result of segmentation model is too bad.
Why bilinear is slower than nearest after QAT?
Is there anyone can explain and give some suggestions.