Hi all, I’m fairly new to model optimization and I’ve tried ONNX PTQ methods. However, I am required to explore QAT for YOLO pytorch models and I’m not sure what to start with.
Should I use Eager Mode or FX Graph Mode Quantization?
Which of them is easier and more general to different models?
Thanks in advance!