What is the need of fusing layers in MobileNetv2?

Midhilesh · June 15, 2020, 5:39am

I have seen the static quantization tutorial, where the layers are fused before itself. I got good result with fused layers but if I don’t fuse the layers, My accuracy is very poor.

What is the effect of layer fusion?

Please do help me with this.

jerryzh168 · June 16, 2020, 12:36am

layer fusion is going to fuse Conv+BN into a Conv module or Conv + BN + ReLU into a ConvRelu module. this does not change numerics itself. Without fusion conv, bn and relu will be quantized independently, that might be the reason why the accuracy drops.

Midhilesh · June 18, 2020, 11:42am

But, what is the drawback of quantizing convolution, batchnorm, relu operations independently?

jerryzh168 · June 23, 2020, 4:11pm

quantizing them independently will have worse performance, and also may suffer from bigger accuracy loss.