Hi - I am writing a script to quantize my .pth model ( universal image segmentation model) with dynamic quantization technique referred below.
https://pytorch.org/docs/stable/quantization.html
my .pth file is built on mask2former architecture which has multiheadattention module in it. Here attached screenshot where this module is not supported for dynamic quantization. Can anyone confirm on this please!!