Assuming a setting:
- I have a model.onnx
- I want to quant this model via a new quant algrithom(i.e. 4bit)
The question: I want to use the torch.fx as IR to represent the model.onnx, has any API? or any methods?
Assuming a setting:
The question: I want to use the torch.fx as IR to represent the model.onnx, has any API? or any methods?