Additional layer in the conv weight after quantization
|
|
0
|
147
|
January 20, 2024
|
Question about QAT
|
|
1
|
163
|
January 19, 2024
|
Select the right observers in QAT
|
|
5
|
219
|
January 19, 2024
|
AttributeError: 'NoneType' object has no attribute 'dequantize'
|
|
10
|
355
|
January 14, 2024
|
Could not run 'aten::_log_softmax.out' with arguments from the 'QuantizedCPU' backend. This could be because the operator doesn't exist for this backend, or was omitted during the selective/custom build process (if using custom build)
|
|
2
|
238
|
January 13, 2024
|
Convert back to Unquantized model
|
|
14
|
1197
|
January 13, 2024
|
Extremely bad LSTM Static Quantization performance compared to Dynamic
|
|
3
|
355
|
January 12, 2024
|
How to convert the quantized model to tensorrt for GPU inference
|
|
9
|
1169
|
January 11, 2024
|
Is there a way to perform inference on the QAT model using a GPU?
|
|
1
|
212
|
January 11, 2024
|
NotImplementedError: Could not run 'aten::_slow_conv2d_forward' with arguments from the 'QuantizedCPU' backend
|
|
4
|
1913
|
January 11, 2024
|
Resnet18 fx_qat to onnx
|
|
1
|
181
|
January 11, 2024
|
Significant Slowdown in Inference Speed with Quantized Model in PyTorch 2.1 pt2e
|
|
5
|
389
|
January 7, 2024
|
How to extract the intermediate layers of vgg16 model
|
|
1
|
241
|
January 5, 2024
|
Unconverted GroupNorm with FX Graph Mode Quantization
|
|
9
|
292
|
December 18, 2023
|
BatchNorm and ConvTranspose Fusion for QAT with FX Graph Mode
|
|
3
|
317
|
December 18, 2023
|
Custom weight observer for powers of 2
|
|
1
|
359
|
December 15, 2023
|
Unchanged behaviour of using a pretrained Model for QAT
|
|
1
|
224
|
December 15, 2023
|
Qlinear (ONEDNN): data type of input should be QUint8
|
|
1
|
212
|
December 15, 2023
|
Does quantization in eager mode require inserting multiple different FFs?
|
|
1
|
203
|
December 15, 2023
|
In flatten the output scale is different from the input scale
|
|
1
|
179
|
December 15, 2023
|
ONNX export of quantized model
|
|
39
|
21688
|
December 7, 2023
|
Shall I remove the BN and ReLU in C progress?
|
|
1
|
284
|
December 5, 2023
|
Missing Histograms for LayerNorm in Numeric Suite Analysis
|
|
3
|
207
|
November 30, 2023
|
Quantization of a vgg16 pretrained model
|
|
4
|
360
|
November 30, 2023
|
Model parameters and MACs
|
|
3
|
217
|
November 24, 2023
|
"How to quantize the bias of a convolution in QAT (Quantization Aware Training) mode?
|
|
1
|
292
|
November 22, 2023
|
Does PyTorch 2.1 Support Learnable Post-Training Quantization?
|
|
1
|
222
|
November 22, 2023
|
ONNX export of simple quantized model fails
|
|
10
|
962
|
November 15, 2023
|
How to convert a model to ONNX before conversion?
|
|
0
|
281
|
November 14, 2023
|
The result of quantized conv2d is different from the result I calculate
|
|
4
|
1367
|
November 12, 2023
|