I’ve read the quantization doc here: (beta) Static Quantization with Eager Mode in PyTorch — PyTorch Tutorials 2.2.1+cu121 documentation
but I couldn’t find an example for an embedding model to work with quantized.Embedding. Is it documented somewhere?
Simply replace the embedding module, lead to error of empty parameter list error.
Thanks!