How to avoid Quantization warning: "Must run observer before calling calculate_qparams."?

FabianSchuetze · June 18, 2020, 10:56am

I worked with the pytorch tutorial for static quantization and when running the line:

torch.quantization.convert(per_channel_quantized_model, inplace=True)

I receive the following warning:

.../torch/quantization/observer.py:845: 
UserWarning: must run observer before calling calculate_qparams. Returning default scale and zero point

I call the convert function within the following lines of code:

per_channel_quantized_model = load_model(..)
per_channel_quantized_model.eval()
per_channel_quantized_model.fuse_model()
per_channel_quantized_model.qconfig = torch.quantization.get_default_qconfig('fbgemm')
print(per_channel_quantized_model.qconfig)
torch.quantization.prepare(per_channel_quantized_model, inplace=True)
evaluate(per_channel_quantized_model, ...)
torch.quantization.convert(per_channel_quantized_model, inplace=True)

Does somebody have an idea what the warning means and how I can avoid that? I appreciate any hints and suggestions!

khizar-anjum · June 18, 2020, 3:08pm

Facing the same issue. torch.quantization.convert is supposed to run the observers, right. This warning does not make sense.

supriyar · June 19, 2020, 9:11pm

The prepare script inserts the observers. After that when model forward is run it also runs the observers.
If you call convert without calling prepare then can complain about not running observers.

Which model are you running this on? We can take a look if there is a repro.

FabianSchuetze · June 20, 2020, 7:41am

Thanks for your replies, @khizar-anjum and @supriyar!

After @khizar-anjum comments, I also filed a issue on github. The warning is thrown when running the static quantization tutorial. I also received the warning in a SSD-type model I wrote. The quantization lead to a low accuracy and I began asking myself if it was caused by the improper quantization the observer warns against.

blueskywwc · July 6, 2020, 6:59am

Hello, I also encountered this problem, is there any latest solution, thank you!

FabianSchuetze · July 6, 2020, 3:05pm

see here for a solution.

blueskywwc · July 7, 2020, 8:56am

Replace self.skip_add.add with torch.add

class InvertedResidual(nn.Module):
def init(self,in_channel,out_channel,stride,expand_ratio):
super(InvertedResidual, self).init()
hidden_channel=int(round(in_channel*expand_ratio))
self.shortcut=stride==1 and in_channel==out_channel

    layers=[]
    if expand_ratio!=1:
        #1x1 pointwise conv
        layers.append(ConvBnRelu(in_channel,hidden_channel,kersize=1))
    layers.extend([
        # 3x3 depthwise conv
        ConvBnRelu(hidden_channel,hidden_channel,stride=stride,groups=hidden_channel),
        nn.Conv2d(hidden_channel,out_channel,kernel_size=1,bias=False),
        nn.BatchNorm2d(out_channel),
    ])

    self.conv=nn.Sequential(*layers)
    #self.skip_add = nn.quantized.FloatFunctional()

def forward(self,x):
    if self.shortcut:
        #return self.skip_add.add(x,self.conv(x))
        return torch.add(x,self.conv(x))
    else:
        return self.conv(x)

RuntimeError: Could not run ‘aten::add.Tensor’ with arguments from the ‘QuantizedCPUTensorId’ backend. ‘aten::add.Tensor’ is only available for these backends: [CPUTensorId, MkldnnCPUTensorId, SparseCPUTensorId, VariableTensorId].

Where did I write it wrong? thanks!

Ashar_Ali · September 21, 2020, 7:42pm

I am still facing the same issue even after following the instructions present in the link you shared. Are there any further updates to it?