Matrix quantization while training

I am having a bit trouble navigating the new quantization lib. I’m trying to train while quantizing a tensor matrix which is multiplied with a float32 convolutional layer weight. What would be the best way going about it?

Hi @Toshi, can you provide more details? Ideally a small code snipped would help. Have you seen the tutorials under https://pytorch.org/tutorials/#quantization-experimental?