Pointers to bring quantized models to device

111357 · March 14, 2023, 11:34am

How about exporting the model into ONNX first and then quantizing the ONNX model by Onnxruntime.