|
Quantization of a vgg16 pretrained model
|
|
4
|
837
|
November 30, 2023
|
|
Model parameters and MACs
|
|
3
|
487
|
November 24, 2023
|
|
"How to quantize the bias of a convolution in QAT (Quantization Aware Training) mode?
|
|
1
|
627
|
November 22, 2023
|
|
Does PyTorch 2.1 Support Learnable Post-Training Quantization?
|
|
1
|
321
|
November 22, 2023
|
|
ONNX export of simple quantized model fails
|
|
10
|
2742
|
November 15, 2023
|
|
How to convert a model to ONNX before conversion?
|
|
0
|
439
|
November 14, 2023
|
|
The result of quantized conv2d is different from the result I calculate
|
|
4
|
1901
|
November 12, 2023
|
|
Unable to convert a quantized model to TorchScript
|
|
2
|
729
|
November 10, 2023
|
|
List of operations supported by QuantizedCPU backend
|
|
2
|
852
|
November 10, 2023
|
|
Is it possible to do int16 qat?
|
|
1
|
1236
|
November 10, 2023
|
|
Is the Static Quantization(PTSQ) model performs Integer-Arithmetic-Only Inference?
|
|
1
|
559
|
November 10, 2023
|
|
How does quantized conv2d handle scale and zero_point?
|
|
5
|
2255
|
November 9, 2023
|
|
Accuracy drop after model quantization
|
|
3
|
778
|
November 4, 2023
|
|
What's the supported datatype for activation in torch.ao.nn.quantized.linear?
|
|
2
|
507
|
November 4, 2023
|
|
Accuracy drop after prepare_qat_fx with no quantization
|
|
2
|
445
|
November 1, 2023
|
|
Why does an unsigned torch.quint8 tensor have a sign?
|
|
6
|
1045
|
November 1, 2023
|
|
Quantization of ssdlite from torchvision
|
|
1
|
455
|
October 26, 2023
|
|
RuntimeError: Unsupported qscheme: per_channel_affine during fx qat
|
|
8
|
2386
|
October 17, 2023
|
|
Static quantization for Transformer block : AttributeError 'function' object has no attribute 'is_cuda'
|
|
5
|
879
|
October 13, 2023
|
|
Quantization official example
|
|
11
|
1083
|
October 13, 2023
|
|
Quantization of a single tensor
|
|
3
|
1171
|
October 12, 2023
|
|
Custom Quantization using PT2 -> q/dq representation
|
|
1
|
493
|
October 12, 2023
|
|
Quantization of a pytorch model
|
|
1
|
440
|
October 12, 2023
|
|
Mixed Precision Training and Quantisation Aware Training together
|
|
1
|
749
|
October 12, 2023
|
|
How to let the two input nodes of `add` op share the same quantization params(scale and zero point)?
|
|
2
|
536
|
October 9, 2023
|
|
Quantization of multi_head_attention_forward
|
|
11
|
2894
|
October 4, 2023
|
|
Error trying to quantize Transformer model
|
|
2
|
738
|
October 3, 2023
|
|
How to train non quantized layers of quantized model on GPU
|
|
1
|
455
|
October 3, 2023
|
|
Torchscript with dynamic quantization produces inconsistent model outputs in Python and Java
|
|
9
|
1079
|
October 3, 2023
|
|
Is bias quantized while doing pytorch static quantization?
|
|
18
|
5537
|
September 27, 2023
|