Different implementation of square function

WeiQin_Chuah · June 20, 2023, 5:29am

I am working on writing a custom convolution function that involves taking the square of some inputs (x). I am wondering what are the differences between different square implementations:

torch.square(x)
torch.pow(x)
x**2
x*x

I would like to know how PyTorch handles each of these operations and if there is any difference in term of gradient computation and efficiency.

Thank you

ptrblck · June 20, 2023, 8:20am

You can use any of these as they should all dispatch to the same kernel.
A similar question was also asked here.