@mohit7, it might be useful to take a look at the quantized models uploaded in torchvision.
Here is a link to the resnet model - https://github.com/pytorch/vision/blob/master/torchvision/models/quantization/resnet.py
I think if you follow the same flow for your model by re-implementing def fuse_model(self):
, it should work.