How cuold I compute the pairwise cosine_similarity() with tensor

Hi, i want to compute the pairwise cosine_similarity of Tensor.shape == (N, 200), and i will get the similarity matrix with shape == (N,N) , Moreover, i want comput it with GPU. can any one could give me some advices? i will be very appreciate!