Inference result not aligned with local version of same model and revision

rpelissier · June 26, 2025, 9:08am

Thank tou erikkaum, now I understand.
So this feels like a serious bug to have an inference service ignoring some layers of the inference model. A big warning should show, at least.
I am sorry but to me it is a blocker for adoption of your product. It is a nice idea, but not reliable for production. I will give another try in 6 months. In the mean time I will go terraform and some autoscalable docker container. (No so easy though, I have been working on it for the past couple of day, and autoscaling with caching the model weights and with enough CPU, is not really what it was designed for.

Topic		Replies	Views
Embedding endpoint returning [None] embeddings Inference Endpoints on the Hub	3	167	March 12, 2025
SentenceSimilarityInputsCheck expected dict not list: `__root__` in `parameters` Beginners	7	1886	August 11, 2023
Can one get embeddings from an inference API that computes Sentence Similarity (in 2023)? Inference Endpoints on the Hub	0	419	June 3, 2023
Integration Issue with Finetuned Embedding Inference Endpoint Inference Endpoints on the Hub	0	46	November 18, 2024
Calling Inference API for text embedding Inference Endpoints on the Hub	1	1875	August 4, 2023

Inference result not aligned with local version of same model and revision

Related topics