Support for quantization in int16

m-chenie · November 12, 2024, 3:02pm

Hi there,

I see in the wiki about Quantized Tensors Introducing Quantized Tensor · pytorch/pytorch Wiki · GitHub, it mentions that there are plans to support int16 quantization. Although, that page was last edited in 2020.

Are there any updates on support for int16 quantization?

Thanks,
Miranda

HDCharles · November 12, 2024, 8:48pm

no if you wanted to do that you’d normally just use bf16 or fp16 quantization. There are no plans to support it in pytorch at present.

m-chenie · November 25, 2024, 7:25pm

Is there a way to do custom quantization so I can quantize to int16?

jerryzh168 · November 27, 2024, 6:32pm

which backend do you want to deploy to?

m-chenie · December 5, 2024, 4:06pm

a custom hardware acceleration that does fixed point arithmetic

jerryzh168 · December 5, 2024, 9:40pm