Inference result not aligned with local version of same model and revision

John6666 · June 24, 2025, 2:37pm

If it’s for a paid service, using Expert Support is probably the fastest and most reliable option, especially if it seems like a bug.

BTW, on my local PC:

from sentence_transformers import SentenceTransformer # sentence-transformers     4.0.1
import torch
sentences = ["This is an example sentence", "Each sentence is converted"]
device = "cuda" if torch.cuda.is_available() else "cpu"
print(f"Running on {device}.") # Running on cuda.

model = SentenceTransformer("sentence-transformers/LaBSE").to(device)
embeddings = model.encode(sentences)
print("main:", embeddings)
#main: [[ 0.02882478 -0.00602382 -0.05947006 ... -0.03002249 -0.029607
#   0.00067482]
# [-0.05550233  0.02546483 -0.02157256 ...  0.02932105  0.01150041
#  -0.00848792]]

model = SentenceTransformer("sentence-transformers/LaBSE", revision="836121a0533e5664b21c7aacc5d22951f2b8b25b").to(device)
embeddings = model.encode(sentences)
print("836121a0533e5664b21c7aacc5d22951f2b8b25b:", embeddings)
#836121a0533e5664b21c7aacc5d22951f2b8b25b: [[ 0.02882478 -0.00602382 -0.05947006 ... -0.03002249 -0.029607
#   0.00067482]
# [-0.05550233  0.02546483 -0.02157256 ...  0.02932105  0.01150041
#  -0.00848792]]

model.to("cpu")
embeddings = model.encode(sentences)
print("On CPU:", embeddings)
#On CPU: [[ 0.02882476 -0.00602385 -0.05947007 ... -0.03002251 -0.02960699
#   0.00067482]
# [-0.05550234  0.02546484 -0.02157255 ...  0.02932107  0.01150037
#  -0.00848786]]

Topic		Replies	Views
Embedding endpoint returning [None] embeddings Inference Endpoints on the Hub	3	167	March 12, 2025
SentenceSimilarityInputsCheck expected dict not list: `__root__` in `parameters` Beginners	7	1886	August 11, 2023
Can one get embeddings from an inference API that computes Sentence Similarity (in 2023)? Inference Endpoints on the Hub	0	419	June 3, 2023
Integration Issue with Finetuned Embedding Inference Endpoint Inference Endpoints on the Hub	0	46	November 18, 2024
Calling Inference API for text embedding Inference Endpoints on the Hub	1	1875	August 4, 2023

Inference result not aligned with local version of same model and revision

Related topics