Using Accelerated Inference API to produce sentense embeddings

Hi @vitali!

Your question comes in good time. You can already do this by calling
https://api-inference.huggingface.co/pipeline/feature-extraction/MODEL_ID. This endpoint is in experimental state at the moment, so things might not be stable.

Note that, as of now, we’re working on deeply integrating sentence-transformers with the Hub. This will be part of the v2 release of the library. Some details:

  • Allow downloading sentence-transformer models from the Hub (PR, merged).
  • Allow uploading sentence-transformer models from the Hub (PR).

We expect to have more exciting results very soon :slight_smile:

5 Likes