How do I save and load quantization model

Tiru_B · December 16, 2019, 9:13am

I have quantized resenet50, quntize_per_channel_resent50 model is giving good accuracy same as floating-point. If I do torch jit save then I can load torch jit load. and do the inference.

How can I use a torch.save and torch.load model on a quantized model?
Will the entire state dict have same scale and zero points?
How can I get each layer scale and zero points from the quantized model?

jerryzh168 · December 18, 2019, 12:29am

How can I use a torch.save and torch.load model on a quantized model?

Currently we only support torch.save(model.state_dict()) and model.load_state_dict(…) I think. torch.save/torch.load model directly is not yet supported I believe.

Will the entire state dict have same scale and zero points?

No, they’ll have scale/zero_point that’s calculated from the calibration step.

How can I get each layer scale and zero points from the quantized model?

you can print the quantized model and it will show scale and zero_point, e.g.:

> print(torch.nn.quantized.Conv2d(3, 3, 3))
QuantizedConv2d(3, 3, kernel_size=(3, 3), stride=(1, 1), scale=1.0, zero_point=0)

Tiru_B · January 8, 2020, 5:58am

Thank you @jerryzh168

I was able to save with model.state_dict() but not able to lad the model with same model.load_state_dict(). It was giving keyError.

Secondly if I save with torch.jit.save(torch.jit.script(pcqmodel),“quantization_per_channel_model.pth”)

I am not able to see the Quantization info after loading the model . Referred in this issue

github.com/pytorch/pytorch

How to save quantized model in PyTorch1.3 with quantization information

opened 07:55AM - 19 Oct 19 UTC

closed 05:05PM - 23 Oct 19 UTC

vippeterhou

oncall: quantization triaged

## ❓ How to save the quantized model in PyTorch1.3 with quantization information… Is there any way to save the quantized model in PyTorch1.3, which keeps the original information remaining? I have known that I can save it after tracing it by: ```python # Save torch.jit.save(torch.jit.script(self.model_q), "quant_model.pth") # Load mq = torch.jit.load("quant_model.pth") ``` Although `mq` has the right result, it, **however**, losts the quantized information, such as module(layer) name, zero point, scale, etc. cc @jerryzh168 @jianyuh @dzhulgakov @raghuramank100

jerryzh168 · March 3, 2020, 5:43pm

are you using the most recent version? could you try again with PyTorch nightly builds?

Zafar · March 3, 2020, 10:17pm

Also, check if it is just the __repr__ that is not showing the info or are the quant params really missing – try getting the scale and zero_point directly.

wolffadam · March 9, 2020, 9:21am

Be sure you do the whole post training preparation process (by running layer fusion, torch.quantization.prepare() and torch.quantization.convert() ) before loading the state_dict.

praneet195 · September 27, 2021, 7:27am

Has this been fixed? I’m unable to save and load quantized models even after following all the steps.

Vasiliy_Kuznetsov · September 29, 2021, 3:24pm

Has this been fixed? I’m unable to save and load quantized models even after following all the steps.

do you have a reproducible example on a toy model?