Fine Tuning A sentence transformer model with my own data

Rushikesh124 · March 12, 2024, 2:04pm

Hi all!

Cheers to this big community (and my first post here )

I am trying to fine tune a sentence transformer model. The data I have contains below columns:

raw_text - the raw chunks of text
label - corresponding label for the text - True or False.

I wanted to fine tune a sentence transformer model such that the embeddings are optimized in a way that all the True sentences are closer in the vector space than all the False sentence.

I have been reading about the losses from Loss Overview — Sentence-Transformers documentation

I am really confused which loss to use for my type of data and use-case. I am leaned towards below:

since it matches my data format. As I read more about these losses and the way they are being computed using anchor, positive and negative samples I feel less confident in using them since my data does not have these kind of pair.

Can someone here help me understand if what I am trying to do is plausible with existing losses in sentence transformer library?

nielsr · March 12, 2024, 7:51pm

Hi,

Refer to this blog post: Train and Fine-Tune Sentence Transformers Models. The blog post has a section on various cases that your dataset may be in, and which loss functions one can use correspondingly.

shrijayan · April 17, 2024, 5:31pm

I am trying to train the Sentence Transformer model also checked the link but always I get the accuracy of only 20 to 30%.

Is it maybe base model ? what base model to train
How to evaluate these models?

Topic		Replies	Views
Fine tuning a sentence transformer model for [single_sentence, label] format? 🤗Transformers	0	505	February 13, 2023
Fine tuning a sentence-transformer for cosine sim on 500k sentence pairs without labels-- advice 🤗Transformers	2	1201	April 20, 2024
Fine-tuning sentence-transformer for retrieval task makes things worse Beginners	0	1722	July 25, 2023
Sentence transformer poor performance after fine tuning 🤗Transformers	1	1590	September 11, 2022
Sentence transformers - SoftmaxLoss Models	1	967	June 20, 2024

Fine Tuning A sentence transformer model with my own data

Related topics