About the quantization category
|
|
0
|
2149
|
October 2, 2019
|
How is quantization of activations handled in pytorch after QAT?
|
|
15
|
2067
|
March 18, 2024
|
A few questions about QConfig in quantization
|
|
1
|
37
|
March 18, 2024
|
Index out of bounds Error with PerChannel Quantization
|
|
5
|
497
|
March 18, 2024
|
Example of using quantized.Embedding?
|
|
1
|
39
|
March 18, 2024
|
FX mode static_quantization for YOLOv7
|
|
1
|
57
|
March 18, 2024
|
Graph tracing false when meeting tensor slicing operation
|
|
1
|
50
|
March 18, 2024
|
Where is torch.ops.quantized.conv2d defined?
|
|
1
|
56
|
March 18, 2024
|
Could not run 'quantized::conv2d.new' with arguments from the 'QuantizedCUDA' backend
|
|
11
|
10502
|
October 5, 2022
|
How to initialize a quantized.Embedding?
|
|
1
|
60
|
March 3, 2024
|
What is the correct way to qat a conv layer with weight norm
|
|
2
|
487
|
March 2, 2024
|
For 4bit quantization
|
|
1
|
529
|
March 1, 2024
|
Error with static quantization
|
|
0
|
67
|
March 1, 2024
|
Input image with int?
|
|
8
|
1843
|
February 29, 2024
|
Post Training Static Quantization API still uses float weights instead of int?
|
|
4
|
1150
|
February 28, 2024
|
How to apply per_tensor_symmetric activation quantization?
|
|
7
|
579
|
February 28, 2024
|
LSTM quant to get the scale and zp of hidden input
|
|
0
|
54
|
February 28, 2024
|
Quantized Resnet has no parameters()
|
|
2
|
74
|
February 27, 2024
|
TypeError: quantized_add() missing 2 required positional arguments: 'op_scale' and 'op_zero_point'
|
|
6
|
524
|
February 27, 2024
|
How does gradient calculation in quantization-aware training? Straight through estimatior?
|
|
0
|
66
|
February 27, 2024
|
Is pytorch simulating the quantization?
|
|
0
|
76
|
February 26, 2024
|
Implementing Quantized Linear Layer in Numpy
|
|
0
|
67
|
February 26, 2024
|
Quantization Bug in Concatenation of Tensor
|
|
0
|
64
|
February 22, 2024
|
Does "De-Quantization" happens/executed after every Conv/Linear layer when using "torch.quantization.quantize_dynamic"
|
|
4
|
91
|
February 22, 2024
|
QAT - how to handle nn.Parameter
|
|
5
|
178
|
February 21, 2024
|
What is the recommended way to use (PyTorch naïve quantization) when deploying to int8 TensorRT?
|
|
0
|
77
|
February 21, 2024
|
Correctly changing precision in DLRM
|
|
1
|
67
|
February 20, 2024
|
Wav2vec2 quantization dimention error
|
|
4
|
116
|
February 20, 2024
|
How to export a correct quantized model to onnx format
|
|
5
|
1044
|
February 16, 2024
|
Decrease in model parameters in dynamic quantization
|
|
3
|
146
|
February 13, 2024
|