How about exporting the model into ONNX
first and then quantizing the ONNX
model by Onnxruntime.
1 Like
How about exporting the model into ONNX
first and then quantizing the ONNX
model by Onnxruntime.