Permutation not working in Quantized Model

matteo-ronchetti · September 2, 2020, 2:37pm

I have a model that contains a custom permutation that I need to apply to the second dimension of a Tensor, this is implemented as

def forward(self, x: torch.Tensor):
    return x[:, self.permutation]

where self.permutation is a LongTensor.
When the model is not quantized (x is a FloatTensor) everything works correctly, when I quantize the model I get the following error:

RuntimeError: Could not run 'aten::empty.memory_format' with arguments from the 'QuantizedCPU' backend. 'aten::empty.memory_format' is only available for these backends: [CPU, CUDA, MkldnnCPU, SparseCPU, SparseCUDA, BackendSelect, Autograd, Profiler, Tracer]

It seems that the operation is not implemented, I’m using PyTorch 1.6.0.
Is there any alternative permutation operation that I can use?

Thanks,
Matteo

hx89 · September 3, 2020, 6:42am

Maybe you can work around it by adding dequantize before this module and quantize after so that it’s not quantized?

matteo-ronchetti · September 3, 2020, 8:25am

Yes it is a possible workaround but, because I need the permutation in many layers (I’m using a variant of ShuffleNet), I think the performance will suffer.

hx89 · September 3, 2020, 11:54pm

@jerryzh168 @raghuramank100 do you know if this is supported in PyTorch quantization?

jerryzh168 · October 1, 2020, 12:35am

I’m not sure what is the operator used here, is it slice?

Please open an issue in https://github.com/issues and provide a repro