Pairwise cosine distance

learnpytorch · November 30, 2018, 1:12pm

I want to find cosine distance between each pair of 2 tensors.
That is given [a,b] and [p,q], I want a 2x2 matrix which finds
[ cosDist(a,p), cosDist(a,q)
cosDist(b,p), cosDist(b,q) ]
I want to be able to use this matrix for triplet loss with hard mining.
What is the best way to do this?

Thanks

InnovArul · November 30, 2018, 3:45pm

I am not sure about cosine distance. You might do something similar to below code.

For L2 loss, you might use this:

github.com

jiyanggao/Video-Person-ReID/blob/master/losses.py#L47-L89


class TripletLoss(nn.Module):
"""Triplet loss with hard positive/negative mining.


Reference:
Hermans et al. In Defense of the Triplet Loss for Person Re-Identification. arXiv:1703.07737.


Code imported from https://github.com/Cysu/open-reid/blob/master/reid/loss/triplet.py.


Args:
    margin (float): margin for triplet.
"""
def __init__(self, margin=0.3):
    super(TripletLoss, self).__init__()
    self.margin = margin
    self.ranking_loss = nn.MarginRankingLoss(margin=margin)


def forward(self, inputs, targets):
    """
    Args:
        inputs: feature matrix with shape (batch_size, feat_dim)

This file has been truncated. show original

learnpytorch · November 30, 2018, 4:32pm

Thanks @InnovArul, I had been referring to this code. I was wondering if it is possible to use nn.CosineSimilarity() instead of computing the cosine similarity manually, just to be sure that there are no errors by me.

John1231983 · October 18, 2019, 2:41am

@learnpytorch: Have you find the solution?

Deeply · October 18, 2019, 1:58pm

Something like:

import torch

def cosine_distance_torch(x1, x2=None, eps=1e-8):
    x2 = x1 if x2 is None else x2
    w1 = x1.norm(p=2, dim=1, keepdim=True)
    w2 = w1 if x2 is x1 else x2.norm(p=2, dim=1, keepdim=True)
    return 1 - torch.mm(x1, x2.t()) / (w1 * w2.t()).clamp(min=eps)


def cosine_similarity_n_space(m1=None, m2=None, dist_batch_size=100):
    NoneType = type(None)
    device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
    if type(m1) != torch.Tensor: # only numpy conversion supported
        m1 = torch.from_numpy(m1).float()
    if type(m2) != torch.Tensor and type(m2)!=NoneType:
        m2 = torch.from_numpy(m2).float() # m2 could be None
        
    m2 = m1 if m2 is None else m2
    assert m1.shape[1] == m2.shape[1]
    
    result = torch.zeros([1, m2.shape[0]])
    
    for row_i in range(0, int(m1.shape[0] / dist_batch_size) + 1):
        start = row_i * dist_batch_size
        end = min([(row_i + 1) * dist_batch_size, m1.shape[0]])
        if end <= start:
            break # cause I'm too lazy to elegantly handle edge cases
        rows = m1[start: end] 
        # sim = cosine_similarity(rows, m2) # rows is O(1) size        
        sim = cosine_distance_torch(rows.to(device), m2.to(device))
        
        result = torch.cat( (result, sim.cpu()), 0)
        
               
    result = result[1:, :] # deleting the first row, as it was used for setting the size only
    del sim
    return result.numpy() # return 1 - ret # should be used with sklearn cosine_similarity

John1231983 · October 18, 2019, 2:53pm

Thanks but it too complex and time consuming

Oli · October 18, 2019, 2:55pm

I worked on this during the summer. Cosine worked well for me

Have a look at this file

John1231983 · October 18, 2019, 3:32pm

@Oli: You just used original cosine sim function. It only returns the triangle value of matrix. I want to return a paired sim, so it will be full matrix

Deeply · October 18, 2019, 11:25pm

@John1231983; it isn’t time consuming if the GPU is enabled.

Usage example:

input1 = torch.randn(5, 7)
input2 = torch.randn(5, 7)

dist_mat  = cosine_distance_torch(input1, input2, 2.4)
print(dist_mat )

John1231983 · October 18, 2019, 11:36pm

Actually, I have input of 5D. so I sure that your way is time-comsuming

Deeply · October 18, 2019, 11:39pm

In fact, it does not work for a 5D input.

John1231983 · October 18, 2019, 11:40pm

@Deeply: I think torch.bmm is more simple and faster your way. But in my case. bmm is memory error

uniquefine · July 26, 2021, 3:33pm

You can use this snippet. Cosine similarity is the same as the scalar product of the normalized inputs and you can get the pw scalar product through matrix multiplication.
Cosine distance in turn is just 1-cosine_similarity.

def pw_cosine_distance(input_a, input_b):
   normalized_input_a = torch.nn.functional.normalize(input_a)  
   normalized_input_b = torch.nn.functional.normalize(input_b)
   res = torch.mm(normalized_input_a, normalized_input_b.T)
   res *= -1 # 1-res without copy
   res += 1
   return res

ebee · May 5, 2022, 8:34am

This github issue gives a nice way to achieve this using the built in cosine_similarity function.
Essentially:

th.nn.functional.cosine_similarity(x[:,:,None], x.t()[None,:,:])

111470 · May 24, 2023, 8:53am

I assume this solution is sample and clean:

since pairwise_cosine_similarity already achieved pairwise cosine distance compute, but do not support batch input. Cosine Similarity — PyTorch-Metrics 0.11.4 documentation (torchmetrics.readthedocs.io)
We can vmap this pairwise_cosine_similarity to make it aviliable for batch data.

# torch 1.13.0

import functorch
from torchmetrics.functional import pairwise_cosine_similarity
batched_pairwise_cosine_similarity= functorch.vmap(pairwise_cosine_similarity)

a=torch.randn(64,150,10)
b=torch.randn(64,100,10)

batched_pairwise_cosine_similarity(a,b)  # [64,150,100]