Reduction in performance of quantized bert model

kfir_goldberg · July 31, 2020, 2:37pm

Hi,
I’m facing a similar issue when quantizing Efficientnet.
I opened a thread about it here, but i was wondering if you found any solutions for your problem