Latest quantization topics

Topic	Replies	Views	Activity
Quantization Bug in Concatenation of Tensor	1	147	April 2, 2024
Roadmap for torch.ao?	2	175	April 2, 2024
Variable-bit (sub 8-bits) quantization for custom hardware deployment with power-of-two (pot) scales	9	1054	April 2, 2024
Question about quint8 and qint8	1	125	March 29, 2024
Do I really need two separate model definition for a quantized and an "unquantized" model?	2	142	March 28, 2024
QAT specific layers of a model	1	96	March 28, 2024
Quantizing model I'm hitting createStatus == pytorch_qnnp_status_success INTERNAL ASSERT FAILED	1	111	March 28, 2024
RuntimeError in torch.quantization.convert after QAT on GPU	2	114	March 28, 2024
Dequantize tensors from int8 to fp16	3	176	March 28, 2024
Run quantized model on GPU	1	224	March 25, 2024
Graph tracing false when meeting tensor slicing operation	6	226	March 25, 2024
For 4bit quantization	2	657	March 22, 2024
Quantized onnx model run slower	1	145	March 22, 2024
Implementing Quantized Linear Layer in Numpy	1	212	March 21, 2024
How to inference with smoothquant quantized model with pytorch?	6	697	March 20, 2024
A few questions about QConfig in quantization	3	199	March 19, 2024
Index out of bounds Error with PerChannel Quantization	6	617	March 19, 2024
How is quantization of activations handled in pytorch after QAT?	15	2521	March 18, 2024
Example of using quantized.Embedding?	1	138	March 18, 2024
Where is torch.ops.quantized.conv2d defined?	1	156	March 18, 2024
Could not run 'quantized::conv2d.new' with arguments from the 'QuantizedCUDA' backend	11	11496	October 5, 2022
How to initialize a quantized.Embedding?	1	134	March 3, 2024
What is the correct way to qat a conv layer with weight norm	2	559	March 2, 2024
Input image with int?	8	2193	February 29, 2024
Post Training Static Quantization API still uses float weights instead of int?	4	1313	February 28, 2024
How to apply per_tensor_symmetric activation quantization?	7	676	February 28, 2024
LSTM quant to get the scale and zp of hidden input	0	126	February 28, 2024
Quantized Resnet has no parameters()	2	179	February 27, 2024
TypeError: quantized_add() missing 2 required positional arguments: 'op_scale' and 'op_zero_point'	6	704	February 27, 2024
How does gradient calculation in quantization-aware training? Straight through estimatior?	0	185	February 27, 2024