How to compute the distance between two word embedding?

Sine · January 20, 2021, 4:02pm

Hi, everyone. I am trying to train a model to output an approximate word embedding. I want to make it close the original word embedding.

Which loss function should I use? Is it MSE Loss?

Thanks.

pascal_notsawo · January 20, 2021, 4:18pm

If I understand you have a pre-trained word embedding model. And you have your own model that you want to train so that it produces representations very close to the one produced by the first one.
Is that it?

Versus · January 20, 2021, 5:14pm

Check CosineSimilarity — PyTorch 1.7.0 documentation

jbschlosser · January 21, 2021, 4:52am

Expanding on @Versus’s answer, the loss function form is nn.CosineEmbeddingLoss.

nn.TripletMarginLoss may be helpful to you as well.

Sine · January 21, 2021, 5:08am

That’s right

Sine · January 21, 2021, 5:09am

Thank you

Sine · January 21, 2021, 5:13am

Sorry, I don’t know how to set the y label of nn.CosineEmbeddingLoss in my case. Is it -1?