Embedding evaluation

Felix-K404 · January 7, 2025, 8:06pm

How does one even evaluate a fine-tuned model? I don’t want to evaluate it during training to keep things modular and since things take a while. I have been using a Triplet dataset for embedding that basically has a question, positive example and a negative example.
This is my code for evaluating so far:

model = SentenceTransformer(model_path, device='cuda')
model.eval()
ds = load_dataset('json', data_files=dataset_path ,split='train')
ds = ds.select(range(1000))

dev_evaluator = TripletEvaluator(
    anchors=ds["question"],
    positives=ds["positive_example"],
    negatives=ds["negative_example"],
    batch_size= 64,
    show_progress_bar= True,
    main_distance_function= SimilarityFunction.COSINE
)
print ("Beginning Evaluation...")
evaluation_score = dev_evaluator(model)
print(evaluation_score["cosine_accuracy"])

However there are a few problems with the code. First of all it somehow runs over all the data three times for some reason, and the second and third time are quite a bit slower than the first time.
Additionally i dont know if its just me but i struggled to even find an example for the main_distance_function, or even an example for a simple evaluation like this.
Any tips what im doing wrong or if i can somehow optimize things further with my GPU ?

John6666 · January 8, 2025, 2:28am

Is the batch size too large and the data not fitting into the GPU’s VRAM, causing a slowdown?

Felix-K404 · January 8, 2025, 1:44pm

Shouldn’t be the case. The GPU got 12 GB and the data isn’t that large either.

John6666 · January 8, 2025, 2:05pm

Hmm. Anyway, there are no documents…

an example for the main_distance_function

Maybe these.

github.com

UKPLab/sentence-transformers/blob/master/sentence_transformers/util.py#L111


      
                  Tensor: Matrix with res[i][j] = cos_sim(a[i], b[j])
              """
              a = _convert_to_batch_tensor(a)
              b = _convert_to_batch_tensor(b)
          
              a_norm = normalize_embeddings(a)
              b_norm = normalize_embeddings(b)
              return torch.mm(a_norm, b_norm.transpose(0, 1))
          
          
          def pairwise_cos_sim(a: Tensor, b: Tensor) -> Tensor:
              """
              Computes the pairwise cosine similarity cos_sim(a[i], b[i]).
          
              Args:
                  a (Union[list, np.ndarray, Tensor]): The first tensor.
                  b (Union[list, np.ndarray, Tensor]): The second tensor.
          
              Returns:
                  Tensor: Vector with res[i] = cos_sim(a[i], b[i])
              """

from sentence-transformers/sentence_transformers/similarity_functions.py at master · UKPLab/sentence-transformers · GitHub

Topic		Replies	Views
Sentence transformer use of evaluator 🤗Transformers	0	1258	April 7, 2023
Sentence-transformers EmbeddingSimilarityEvaluator with no scores Beginners	0	156	June 15, 2024
How to evaluate models Beginners	0	2853	June 16, 2021
Evaluation without using a Trainer Beginners	2	3598	April 16, 2021
Code to train a model on my dataset is not working Beginners	0	508	October 24, 2023

Embedding evaluation

Related topics