Fine Tuning bart-large-mnli on only Entailments

vikram71198 · August 1, 2022, 4:29am

Hi everyone! This is my first question on the forum, so please excuse any mistakes with formatting.

So, I’m currently working on a project which requires me to fine tune bart-large-mnli on a custom dataset, and then use the fine tuned model for Zero Shot Classification.

My custom dataset has only entailments (125 per class for 5 classes = 625 entailments). I’m pretty sure I’ve written my fine tuning code perfectly, but I’m getting very poor results after fine tuning. As in, un fine tuned BART was more confident and more frequently accurate than my fine tuned BART. I’m very new to working with LLMs. I have a couple suspicions.

Is it a problem if I fine tune only on entailments? Is that the issue? Will I also have to include an equal number of contradictions to see a better performance after fine tuning?
Do, I need to expand my dataset? Is my custom dataset too small? I am working in a few shot setting after all.
What should the num_labels parameter be set to? Since, I’m fine tuning only on entailments in my current setting, should I have to change its value?

I’d really appreciate any insights here !

Topic		Replies	Views
Fine-tuning Zero-shot models Intermediate	4	6340	February 7, 2023
Bart-large-mnli zero-shot learning fine tuning problems Beginners	0	675	July 18, 2023
Fine tune model='facebook/bart-large-mnli' Intermediate	0	1270	May 16, 2022
Fine tune Zero-shot classification on multi-label dataset Models	4	3548	November 30, 2023
Pipeline with a fine-tuned pre-trained model Beginners	0	561	May 4, 2023

Fine Tuning bart-large-mnli on only Entailments

Related topics