MobilenetV2 changes in model during quantization

Anirudh_Alameluvari · March 24, 2023, 8:43pm

Hi all, I need help in understanding some of the changes to the model during quantization.

Pre quantization setup:

Relu6 converted to Relu
Quantstub and Dequantstub added to the end of the model (after classifier)
(quant): QuantStub()
(dequant): DeQuantStub()
After QAT and convert
Batch norm layers seem to be removed
First block of the model after preprocessing:
(features): Sequential(
(0): Conv2dNormActivation(
(0): Conv2d(3, 32, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
(1): BatchNorm2d(32, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
(2): ReLU()
)
First block of model after quantization
(features): Sequential(
(0): Conv2dNormActivation(
(0): QuantizedConvReLU2d(3, 32, kernel_size=(3, 3), stride=(2, 2), scale=0.030749445781111717, zero_point=0, padding=(1, 1))
(1): Identity()
(2): Identity()
)

What would the reasoning be behind these changes?

eqy · March 25, 2023, 9:06pm

For the removal of the batchnorm layers, if this is post-training quantization, one reason could be that batchnorm at inference time is doing a scale+shift which could be covered by the quantized layer so adding a separate scale + shift could be redundant.

Anirudh_Alameluvari · March 27, 2023, 6:01am

That’s what I thought too! Any ideas on why relu6 gets converted to relu?