Is there an inference toolkit for rerankers?

tonifuc3m · July 12, 2024, 7:19am

I would like to deploy a reranker local. I am loading it as a crossencoder like in here how to use reranker model with langchain in retrievalQA case? · Issue #13076 · langchain-ai/langchain · GitHub
And then add it to a Langchain’s retriever chain.

I would like to have it deployed in local in a docker container and send queries to the endpoint, like I do with the Text Embedding Inference.

Topic		Replies	Views
Inference result not aligned with local version of same model and revision Inference Endpoints on the Hub	15	60	June 26, 2025
Reranking Algorithms Models	1	840	January 1, 2024
Can one get embeddings from an inference API that computes Sentence Similarity (in 2023)? Inference Endpoints on the Hub	0	418	June 3, 2023
Regarding a Trial Version Inference Endpoints on the Hub	0	208	April 23, 2024
Inference endpoint Intermediate	1	33	August 11, 2024