Fine-tuning a pretrained model - how many data samples are needed for effectiveness?

ja4h3ad · April 4, 2023, 12:18am

Hello. I’ve been running experiments comparing the performance of a Transformer from Huggingface (“cardiffnlp/twitter-roberta-base-sentiment-latest”) and OpenAI’s APIs in the task of text classification/sentiment analysis. Due to the OpenAI cost, I’ve been running very small sample sets. The effort is to use the intersection of each model’s inference to basically ‘annotate’ the text spans. Out of 100 samples on the Positive class and Negative class, I have achieved F1 scores of roughly 74% and 68% respectively (note, the majority class of Neutral always has a high intersection rate). My questions now:

If I want to fine tune the model that I am using with my ‘gold standard’ datasets, will roughly 70 samples for each class be enough to effectively fine tune? Or would I need to get a larger number of utterances in my gold standard dataset to be effective?
Could I use the same transformer as what I used in for the inference intersection study? I think the answer to this is yes, but just want to make sure.

Everyone’s thoughts and critiques are appreciated in my approach. I’m a new ML researcher so I have much to learn.

Topic		Replies	Views
Sequence Classification -- Fine Tune? Beginners	3	3182	January 31, 2021
Thoughts on quantity of training data for fine tuning Beginners	6	20901	March 10, 2022
Sentence transformer poor performance after fine tuning 🤗Transformers	1	1630	September 11, 2022
Sentence Transformer Fine-Tuning Dataset Curation Clarification Beginners	0	561	September 3, 2022
Dataset size for fine-tuning Beginners	0	607	May 21, 2021

Fine-tuning a pretrained model - how many data samples are needed for effectiveness?

Related topics