Hello,
I am trying to profile quantized models using torch.profiler APIs.
Are outputs of those APIs ( CPU_memory_usage, cpu_time, …) accurate?
(from what I understood torch.profiler is made for nn.Module type)
If it is not the case, are there other methods to profile quantized models?
Thank you,