Btw, as this dyamo_export is taking a normal nn.Module in and outputs an ONNX model, can we make a api that enable us to put a tracedfx.Graph into it then output an ONNX model?
Since I found that PyTorch is also working on another quantization tool pt2e, so this api can be usefule that we quantize, optimize models in fx.Graph format, then export to ONNX model for serving.
I would like to know where are we on this. I have fx.graphmodule generated form pt2e after quantization, I am not able to export such a graphmodule to onnx. I am really interested in this because I got the best quantization result using pt2e compare dot eager mode.