|
I saved the quantized weight and loaded it with the model after torch.ao.quantization.convert(). how do I print the output of each layer of the network?
|
|
5
|
518
|
April 2, 2024
|
|
Is pytorch simulating the quantization?
|
|
1
|
375
|
April 2, 2024
|
|
Quantization Bug in Concatenation of Tensor
|
|
1
|
430
|
April 2, 2024
|
|
Roadmap for torch.ao?
|
|
2
|
516
|
April 2, 2024
|
|
Question about quint8 and qint8
|
|
1
|
846
|
March 29, 2024
|
|
Do I really need two separate model definition for a quantized and an "unquantized" model?
|
|
2
|
334
|
March 28, 2024
|
|
QAT specific layers of a model
|
|
1
|
246
|
March 28, 2024
|
|
Quantizing model I'm hitting createStatus == pytorch_qnnp_status_success INTERNAL ASSERT FAILED
|
|
1
|
281
|
March 28, 2024
|
|
RuntimeError in torch.quantization.convert after QAT on GPU
|
|
2
|
455
|
March 28, 2024
|
|
Dequantize tensors from int8 to fp16
|
|
3
|
929
|
March 28, 2024
|
|
Graph tracing false when meeting tensor slicing operation
|
|
6
|
651
|
March 25, 2024
|
|
For 4bit quantization
|
|
2
|
833
|
March 22, 2024
|
|
Quantized onnx model run slower
|
|
1
|
728
|
March 22, 2024
|
|
How to inference with smoothquant quantized model with pytorch?
|
|
6
|
1576
|
March 20, 2024
|
|
A few questions about QConfig in quantization
|
|
3
|
591
|
March 19, 2024
|
|
Index out of bounds Error with PerChannel Quantization
|
|
6
|
1062
|
March 19, 2024
|
|
How is quantization of activations handled in pytorch after QAT?
|
|
15
|
4391
|
March 18, 2024
|
|
Example of using quantized.Embedding?
|
|
1
|
419
|
March 18, 2024
|
|
Where is torch.ops.quantized.conv2d defined?
|
|
1
|
734
|
March 18, 2024
|
|
Could not run 'quantized::conv2d.new' with arguments from the 'QuantizedCUDA' backend
|
|
11
|
16100
|
October 5, 2022
|
|
How to initialize a quantized.Embedding?
|
|
1
|
290
|
March 3, 2024
|
|
What is the correct way to qat a conv layer with weight norm
|
|
2
|
709
|
March 2, 2024
|
|
Input image with int?
|
|
8
|
2978
|
February 29, 2024
|
|
How to apply per_tensor_symmetric activation quantization?
|
|
7
|
1016
|
February 28, 2024
|
|
LSTM quant to get the scale and zp of hidden input
|
|
0
|
301
|
February 28, 2024
|
|
Quantized Resnet has no parameters()
|
|
2
|
376
|
February 27, 2024
|
|
How does gradient calculation in quantization-aware training? Straight through estimatior?
|
|
0
|
448
|
February 27, 2024
|
|
Does "De-Quantization" happens/executed after every Conv/Linear layer when using "torch.quantization.quantize_dynamic"
|
|
4
|
853
|
February 22, 2024
|
|
QAT - how to handle nn.Parameter
|
|
5
|
770
|
February 21, 2024
|
|
What is the recommended way to use (PyTorch naïve quantization) when deploying to int8 TensorRT?
|
|
0
|
328
|
February 21, 2024
|