Is cosine similarity the only way to measure similarity?

MahdiA · July 25, 2022, 5:23am

In most word embedding models like w2v or Glove or … we use cosine similarity when we want to see the performance of the model. However, suppose that we have two vectors in the same direction but with different sizes, regarding cosine similarity they are similar but with another one like Euclidian they are different. (Like these vectors: [10,0] , [1,0])
What is the best way to measure the performance of and embedding model?
Thanks

luisoala · July 28, 2022, 12:33am

depends on what property you are interested in

a nice property of cosine similarity is its scale invariance, as the example you give demonstrates

MahdiA · July 28, 2022, 5:49am

Yes, that is true. So, how can I make sure in feature space, two words which is not similar to each other, take the vector in same direction but with different size like [1,0], [10,0]. Because they may place anywhere in space.

mikeee · August 1, 2022, 9:37am

https://docs.scipy.org/doc/scipy/reference/generated/scipy.spatial.distance.cdist.htm many of the more than 20 different distance measures defined in scipy can probably be easily converted to similarity measures. For 6 (Y = cdist(XA, XB, ‘cosine’)) for example, 1 - cdist(XA, XB, ‘cosine’)) is cosine(XA, XB)?

Topic		Replies	Views
Basic question on cosine similarity Languages at Hugging Face	0	17	May 23, 2025
Text similarity not by cosine similarity Research	3	4686	April 12, 2022
Calculating Cosine Similarity with XLMRobertaModel Embeddings always leads to 0.99 score Beginners	0	322	March 21, 2024
What's a fair way to compute similarities for Contrastive Learning? Intermediate	0	194	February 18, 2024
Combine Text Embedding from LLM's with other features for similarity search Models	0	794	December 11, 2023

Is cosine similarity the only way to measure similarity?

Related topics