How to do quantization for hybrid CNN+RNN(primarily GRU) pytorch model on Nvidia GPU?

Aman_230 · August 18, 2025, 12:53pm

Currently my model has FP32 i want to convert it to INT8, i have trained the model. I explored and deep dived into the pytorch documentation but haven’t able to conclude? Anyone please help me.