Quantization Bug in Concatenation of Tensor
|
|
1
|
147
|
April 2, 2024
|
Roadmap for torch.ao?
|
|
2
|
175
|
April 2, 2024
|
Variable-bit (sub 8-bits) quantization for custom hardware deployment with power-of-two (pot) scales
|
|
9
|
1054
|
April 2, 2024
|
Question about quint8 and qint8
|
|
1
|
125
|
March 29, 2024
|
Do I really need two separate model definition for a quantized and an "unquantized" model?
|
|
2
|
142
|
March 28, 2024
|
QAT specific layers of a model
|
|
1
|
96
|
March 28, 2024
|
Quantizing model I'm hitting createStatus == pytorch_qnnp_status_success INTERNAL ASSERT FAILED
|
|
1
|
111
|
March 28, 2024
|
RuntimeError in torch.quantization.convert after QAT on GPU
|
|
2
|
114
|
March 28, 2024
|
Dequantize tensors from int8 to fp16
|
|
3
|
176
|
March 28, 2024
|
Run quantized model on GPU
|
|
1
|
224
|
March 25, 2024
|
Graph tracing false when meeting tensor slicing operation
|
|
6
|
226
|
March 25, 2024
|
For 4bit quantization
|
|
2
|
657
|
March 22, 2024
|
Quantized onnx model run slower
|
|
1
|
145
|
March 22, 2024
|
Implementing Quantized Linear Layer in Numpy
|
|
1
|
212
|
March 21, 2024
|
How to inference with smoothquant quantized model with pytorch?
|
|
6
|
697
|
March 20, 2024
|
A few questions about QConfig in quantization
|
|
3
|
199
|
March 19, 2024
|
Index out of bounds Error with PerChannel Quantization
|
|
6
|
617
|
March 19, 2024
|
How is quantization of activations handled in pytorch after QAT?
|
|
15
|
2521
|
March 18, 2024
|
Example of using quantized.Embedding?
|
|
1
|
138
|
March 18, 2024
|
Where is torch.ops.quantized.conv2d defined?
|
|
1
|
156
|
March 18, 2024
|
Could not run 'quantized::conv2d.new' with arguments from the 'QuantizedCUDA' backend
|
|
11
|
11496
|
October 5, 2022
|
How to initialize a quantized.Embedding?
|
|
1
|
134
|
March 3, 2024
|
What is the correct way to qat a conv layer with weight norm
|
|
2
|
559
|
March 2, 2024
|
Input image with int?
|
|
8
|
2193
|
February 29, 2024
|
Post Training Static Quantization API still uses float weights instead of int?
|
|
4
|
1313
|
February 28, 2024
|
How to apply per_tensor_symmetric activation quantization?
|
|
7
|
676
|
February 28, 2024
|
LSTM quant to get the scale and zp of hidden input
|
|
0
|
126
|
February 28, 2024
|
Quantized Resnet has no parameters()
|
|
2
|
179
|
February 27, 2024
|
TypeError: quantized_add() missing 2 required positional arguments: 'op_scale' and 'op_zero_point'
|
|
6
|
704
|
February 27, 2024
|
How does gradient calculation in quantization-aware training? Straight through estimatior?
|
|
0
|
185
|
February 27, 2024
|