Pointers to bring quantized models to device

How about exporting the model into ONNX first and then quantizing the ONNX model by Onnxruntime.

1 Like