What can cause model.generate (BART) output to be gibberish after fine-tuning?

rgwatwormhill · August 30, 2020, 7:58pm

Do you need to zero your gradients for BART?

(I’ve not used Bart, but in training Bert I need to use model.zero_grad before passing each batch of data to the model).

Does your data look similar to the data Bart was originally trained on? If it is totally different then your model could get worse before it gets better. What are you hoping it will learn from your new data?

Topic		Replies	Views
BART model fine-tuning give unexpected not relevant results Beginners	0	359	July 23, 2021
Train Bart for Conditional Generation (e.g. Summarization) Models	14	17157	November 22, 2023
Finetuning BART for Abstractive Text Summarisation Beginners	1	5236	September 9, 2024
Question regarding training of BartForConditionalGeneration Models	1	2025	March 2, 2021
BART-base generating completely wrong output after training for more than 3 epochs Intermediate	0	854	July 8, 2021

What can cause model.generate (BART) output to be gibberish after fine-tuning?

Related topics