I’m confused about the meaning of “performance” in this context. Does it refer to the model’s overall accuracy, or specifically its processing speed?
I noticed that calling tensor.contiguous()
after using permute
eliminated a warning message, but unfortunately, it also caused the model to run significantly slower. What’s the explanation for this trade-off?