Should I use cuda tensors for indexing if the data is on cuda?

Ho,

I guess in this case, it considers the Tensor as just a sequence of numbers.
Moving the indices to CUDA will definitely be faster.