Where is the implementation of `at::cuda::cumsum_out`?

I found a calling of at::cuda::cumsum_out in Multinomial.cu cuda source:

cumsum_out

But I can’t find it’s definition all over the torch repo.

OK, I got it: it’s definition is in RegisterCUDA.cpp, just a wrapper to at::native::cumsum.