SentenceTransformer labels for SoftmaxLoss

artificial-cerebrum · January 13, 2024, 7:54pm

I’m trying to finetune a SentenceTransformer using this tutorial: Train and Fine-Tune Sentence Transformers Models.

I have 4 integer labels ranging from 0 to 3, and I’m using SoftmaxLoss, because it seems that ContrastiveLoss only expects 2 labels, and I have 4.
My question is: Should the labels indicate similarity or should they indicate distance? I.e. should 0 mean the sentences are close or should it mean that the sentences are distant?

And another question, when I fit the model and provide an evaluator, evaluation metrics are not printed during training:

train_loss = losses.SoftmaxLoss(model=model, num_labels=4, sentence_embedding_dimension=model.get_sentence_embedding_dimension())
evaluator = evaluation.EmbeddingSimilarityEvaluator.from_input_examples(val_examples)

model.fit(train_objectives=[(train_dataloader, train_loss)],
          evaluator=evaluator,
          epochs=3)

Output:

As you can see, no metrics are printed even though I provided an evaluator. Any idea why?

Any help is appreciated.

Thanks

Topic		Replies	Views
Sentence transformers - SoftmaxLoss Models	1	967	June 20, 2024
Fine tuning a sentence transformer model for [single_sentence, label] format? 🤗Transformers	0	505	February 13, 2023
Fine Tuning A sentence transformer model with my own data Intermediate	2	3076	April 17, 2024
Trainer doesn't get to compute_metrics after upgrading to v4.32 🤗Transformers	4	1432	July 2, 2024
How Labelled Data is Processed \| Transformers Trainer 🤗Transformers	10	4189	April 16, 2024

SentenceTransformer labels for SoftmaxLoss

Related topics