It is necessary to use CPU to do QAT

Nanton · August 18, 2022, 9:01pm

Hi,

I was wondering if it is possible to do QAT with GPU. According to this tutorial ( (beta) Static Quantization with Eager Mode in PyTorch — PyTorch Tutorials 1.12.1+cu102 documentation), we need to use CPU.

However, according to this blog( Introduction to Quantization on PyTorch | PyTorch), we can use either CPU or GPU.

Thanks for your attention

Vasiliy_Kuznetsov · August 19, 2022, 5:41pm

Yes, QAT training works on GPU.

suraj.pt · August 19, 2022, 5:58pm

If you scroll all the way down to the QAT cell in this notebook there is an example of QAT with GPU

Nanton · September 2, 2022, 9:04pm

Thanks for all replies here. It seem QAT is supported by GPU. Maybe this tutorial ((beta) Static Quantization with Eager Mode in PyTorch — PyTorch Tutorials 1.12.1+cu102 documentation) has to be changed a little?

Since at the beginning, it says “Note that quantization is currently only supported for CPUs, so we will not be utilizing GPUs / CUDA in this tutorial.”

Thanks @jerryzh168

jerryzh168 · September 12, 2022, 6:06pm

QAT supports GPU in the beginning, I think that line means it’s not supported in GPU for inference. We had been working on supporting GPU inference through TensorRT and cudnn, but haven’t tried officially releasing them