Dear Users,
I would like to ask for pointers for how to extend nn.Linear
and nn.Conv2d
for post-training static quantization or quantization-aware training without rewriting a lot of stuff, such that it can still be used with operator fusion etc… An example change could be to apply an affine transformation to the weights prior to calling the linear operation. Could someone help please? Thanks!