How to do quantization for hybrid CNN+RNN(primarily GRU) pytorch model on Nvidia GPU?

Currently my model has FP32 i want to convert it to INT8, i have trained the model. I explored and deep dived into the pytorch documentation but haven’t able to conclude? Anyone please help me.