Does my for loop run parallely, if all the tensors involved in the loop are on the GPU?

A related topic was also created by me : Link