Latest quantization topics

Topic	Replies	Views	Activity
About the quantization category	0	2238	October 2, 2019
OutOfMemoryError: CUDA out of memory in Google Colab	1	23	May 1, 2024
Pytorch Int2 training and export	1	30	May 1, 2024
FX mode static_quantization for YOLOv7	15	260	April 27, 2024
Search and modify layer/module outputs by name	1	85	April 26, 2024
Network pruning error	16	1216	April 26, 2024
Quantization - RuntimeError: apply_dynamic is not implemented for this packed parameter type	3	99	April 26, 2024
Can an int8 model derived from pytorch's QAT training be converted directly to tensorRT?	3	62	April 26, 2024
About the int8 training question	16	3188	April 26, 2024
Fixed scale and zero point with FixedQParamsObserver	1	82	April 24, 2024
Could not run 'aten::quantize_per_tensor.tensor_qparams' with arguments from the 'QuantizedCPU'	1	68	April 24, 2024
How can we export the model quantized by `PyTorch 2 Export Quantization` to the binary file?	2	68	April 23, 2024
Histogram Calibration taking incredibly long time	3	64	April 23, 2024
Question on skipping quantization on unsupported modules	10	1612	April 3, 2024
Error during QAT training of ResNet50	4	88	April 2, 2024
Error with static quantization	2	182	April 2, 2024
I saved the quantized weight and loaded it with the model after torch.ao.quantization.convert(). how do I print the output of each layer of the network?	5	283	April 2, 2024
Is pytorch simulating the quantization?	1	164	April 2, 2024
Quantization Bug in Concatenation of Tensor	1	133	April 2, 2024
Roadmap for torch.ao?	2	136	April 2, 2024
Variable-bit (sub 8-bits) quantization for custom hardware deployment with power-of-two (pot) scales	9	1022	April 2, 2024
Question about quint8 and qint8	1	97	March 29, 2024
Do I really need two separate model definition for a quantized and an "unquantized" model?	2	115	March 28, 2024
QAT specific layers of a model	1	82	March 28, 2024
Quantizing model I'm hitting createStatus == pytorch_qnnp_status_success INTERNAL ASSERT FAILED	1	96	March 28, 2024
RuntimeError in torch.quantization.convert after QAT on GPU	2	95	March 28, 2024
Dequantize tensors from int8 to fp16	3	144	March 28, 2024
Run quantized model on GPU	1	146	March 25, 2024
Graph tracing false when meeting tensor slicing operation	6	205	March 25, 2024
For 4bit quantization	2	629	March 22, 2024