I am trying to use Quantization Aware Training to get better performance on my quantized EfficientNet model, since PTQ performed very poorly. I tried using the original hyperparameters from the normal model to train it, and it did not perform well. After decreasing the learning rate, I was able to get much higher performance, but still not very close to the original model.
Can anyone provide me with some guidance for hyperparameter tuning with QAT or any other tips and tricks that could be useful? I was not able to find any good resources with this kind of information, so if you could point me in the direction of those, that would also be helpful.