Absence of qint32 in torch.ao.quantization.utils.weight_is_quantized

tjdgns0928 · September 28, 2024, 5:10pm

I am using FX quantization with a custom backend and custom layers.

Quantization with qint8 is working well. However, when I tried to quantize the model using qint32, the layer was not quantized during the convert_fx step.

Upon investigation, I found that the issue is due to the absence of torch.qint32 in the torch.ao.quantization.utils.weight_is_quantized function.

def weight_is_quantized(qconfig):
    """ Given a qconfig, decide if the weight needs to be
    quantized or not
    """
    return weight_dtype(qconfig) in [
        torch.quint8,
        torch.qint8,
        torch.float16,
        torch.quint4x2,
        torch.uint8,
        torch.int8,
        torch.int16,
        # torch.qint32  <<< this is missing.
    ]

Is there a specific reason why qint32 is not included as a target for weight quantization?

jerryzh168 · November 27, 2024, 6:45pm

feel free to add, although we are moving to a new flow now which has easier support for dtypes since we are using native pytorch dtypes like uint8/int8/int16/int32 instead special qint8 dtypes

https://pytorch.org/docs/main/quantization.html#prototype-pytorch-2-export-quantization