How cloud I convert a quantized model into a lossy model of float32?

Rahul_Xie · October 2, 2022, 5:43am

Given a quantized model (PTQ or QAT), I want to convert it into a model whose parameters representation is float32. Because I want to use GPU. How cloud I achieve it?

jerryzh168 · October 5, 2022, 5:46pm

you mean “dequantize” a model? interesting, I don’t think we support that. Why don’t you use the original floating point model in this case?

Rahul_Xie · October 6, 2022, 4:37am

Thank you for your suggestion. Actually, I want to perform experiments on quantized model. For some reason, the quantized model does not support GPU, which makes the experiment pretty slow.

jerryzh168 · October 6, 2022, 5:41am

maybe you can try [quantization] Frequently Asked Questions to see if this works for you or not

Rahul_Xie · October 6, 2022, 8:14am

Thank you very much.