Using Accelerated Inference API to produce sentense embeddings

osanseviero · May 18, 2021, 2:23pm

Your question comes in good time. You can already do this by calling
https://api-inference.huggingface.co/pipeline/feature-extraction/MODEL_ID. This endpoint is in experimental state at the moment, so things might not be stable.

Note that, as of now, we’re working on deeply integrating sentence-transformers with the Hub. This will be part of the v2 release of the library. Some details:

Allow downloading sentence-transformer models from the Hub (PR, merged).
Allow uploading sentence-transformer models from the Hub (PR).

We expect to have more exciting results very soon

Topic		Replies	Views
Can one get an embeddings from an inference API that computes Sentence Similarity? Beginners	9	5401	March 13, 2025
Return embeddings via inference api 🤗Transformers	0	375	January 17, 2023
Can one get embeddings from an inference API that computes Sentence Similarity (in 2023)? Inference Endpoints on the Hub	0	419	June 3, 2023
Easiest way to get a senetence embedder from a transformers model? 🤗Transformers	1	1395	April 7, 2022
Extracting token embeddings from pretrained language models Beginners	9	22261	May 2, 2024

Using Accelerated Inference API to produce sentense embeddings

Related topics