Computing similarity between sentences

olaffson · July 28, 2021, 9:41pm

Hello there,

I came across this very interesting post (Sentence Transformers in the Hugging Face Hub) that essentially shows a way to extract the embeddings for a given word or sentence

from sentence_transformers import SentenceTransformer
sentences = ["This is an example sentence", "Each sentence is converted"]

model = SentenceTransformer('sentence-transformers/paraphrase-MiniLM-L6-v2')
embeddings = model.encode(sentences)
print(embeddings)

Just a few questions if someone has a few moments:

how are these embeddings different from contextual embeddings that I would get with distilbert and other transformer models?
More importantly, once I have the embeddings I can simply compute a cosine similarity metrics with other sentences to cluster by similarity. If so, what is the need of an API as described here Sentence Transformers in the Hugging Face Hub. Am I missing something more subtle here?

Thanks!!

lewtun · July 29, 2021, 1:26am

hey @olaffson, as described in the sentencebert paper uses a siamese network structure to learn the sentence embeddings.

in general, this approach gives higher-quality embeddings than those you’d get from distilbert etc and you can find a nice performance chart here

regarding your second question, i’m not sure which api you’re referring to exactly in the blog post (which is mostly about the integration of sentence-transformers with the hugging face hub). but indeed, once you have the embeddings you can compute metrics / cluster using whatever tools you wish

olaffson · July 29, 2021, 8:55am

hi @lewtun thanks for this useful reference. I will look at it shortly. As for the API, I am referring to this part

import json
import requests

API_URL = "https://api-inference.huggingface.co/models/sentence-transformers/paraphrase-MiniLM-L6-v2"
headers = {"Authorization": "Bearer YOUR_TOKEN"}

def query(payload):
    response = requests.post(API_URL, headers=headers, json=payload)
    return response.json()

data = query(
    {
        "inputs": {
            "source_sentence": "That is a happy person",
            "sentences": [
                "That is a happy dog",
                "That is a very happy person",
                "Today is a sunny day"
            ]
        }
    }
)

If similarity is simply computed with a dot product (cosine similarity) why do we need to call the API? I think I might be missing something obvious here…

Thanks!

lewtun · July 29, 2021, 7:40pm

hi @olaffson, the inference api is useful if your model needs to fit inside some larger application and you don’t want to worry about all the infrastructure concerns around scaling / deployment etc.

having said that, if you’re just tinkering with embeddings or don’t need to deploy the model, then it’s probably simpler to just load the model on your machine and compute the embeddings directly

olaffson · July 31, 2021, 9:02pm

makes total sense! thanks @lewtun

Topic		Replies	Views
Can one get an embeddings from an inference API that computes Sentence Similarity? Beginners	9	5340	March 13, 2025
How to obtain similarity values from embeddings? Beginners	2	427	April 29, 2022
How to use embeddings to compute similarity? Beginners	4	4429	January 27, 2022
Can one get embeddings from an inference API that computes Sentence Similarity (in 2023)? Inference Endpoints on the Hub	0	418	June 3, 2023
Can Similarity Sentence Returns the Similarity Content? 🤗Transformers	0	324	April 27, 2023

Computing similarity between sentences

Related topics