Latest quantization topics

Topic	Replies	Views	Activity
Additional layer in the conv weight after quantization	0	147	January 20, 2024
Question about QAT	1	163	January 19, 2024
Select the right observers in QAT	5	219	January 19, 2024
AttributeError: 'NoneType' object has no attribute 'dequantize'	10	355	January 14, 2024
Could not run 'aten::_log_softmax.out' with arguments from the 'QuantizedCPU' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build)	2	238	January 13, 2024
Convert back to Unquantized model	14	1197	January 13, 2024
Extremely bad LSTM Static Quantization performance compared to Dynamic	3	355	January 12, 2024
How to convert the quantized model to tensorrt for GPU inference	9	1169	January 11, 2024
Is there a way to perform inference on the QAT model using a GPU?	1	212	January 11, 2024
NotImplementedError: Could not run 'aten::_slow_conv2d_forward' with arguments from the 'QuantizedCPU' backend	4	1913	January 11, 2024
Resnet18 fx_qat to onnx	1	181	January 11, 2024
Significant Slowdown in Inference Speed with Quantized Model in PyTorch 2.1 pt2e	5	389	January 7, 2024
How to extract the intermediate layers of vgg16 model	1	241	January 5, 2024
Unconverted GroupNorm with FX Graph Mode Quantization	9	292	December 18, 2023
BatchNorm and ConvTranspose Fusion for QAT with FX Graph Mode	3	317	December 18, 2023
Custom weight observer for powers of 2	1	359	December 15, 2023
Unchanged behaviour of using a pretrained Model for QAT	1	224	December 15, 2023
Qlinear (ONEDNN): data type of input should be QUint8	1	212	December 15, 2023
Does quantization in eager mode require inserting multiple different FFs?	1	203	December 15, 2023
In flatten the output scale is different from the input scale	1	179	December 15, 2023
ONNX export of quantized model	39	21688	December 7, 2023
Shall I remove the BN and ReLU in C progress?	1	284	December 5, 2023
Missing Histograms for LayerNorm in Numeric Suite Analysis	3	207	November 30, 2023
Quantization of a vgg16 pretrained model	4	360	November 30, 2023
Model parameters and MACs	3	217	November 24, 2023
"How to quantize the bias of a convolution in QAT (Quantization Aware Training) mode?	1	292	November 22, 2023
Does PyTorch 2.1 Support Learnable Post-Training Quantization?	1	222	November 22, 2023
ONNX export of simple quantized model fails	10	962	November 15, 2023
How to convert a model to ONNX before conversion?	0	281	November 14, 2023
The result of quantized conv2d is different from the result I calculate	4	1367	November 12, 2023