How to finetune mT5

LaibaMehnaz · July 19, 2021, 10:20am

I am using mT5 for the task of summarization on a language other than English. But even after training for 30 epochs, the generations are very bad with rouge 1 as 31.5, whereas mBART gives a rouge 1 of 43.1 after training only for 11 epochs.
I wanted to know if mT5’s performance is expected to be like this compared to mBART, or am I doing something wrong.
Appreciate any help. Thank you

Topic		Replies	Views
Help with finetuning mBART on an unseen language Models	19	2060	October 30, 2020
Finetuned MT5 model generating the same first token for any input Intermediate	0	231	May 9, 2023
MBart Zero Shot Transfer Learning Beginners	0	350	June 4, 2021
mBART finetuning tips/post-mortem 🤗Transformers	6	2643	November 17, 2020
mBART fine tuning performs worse Beginners	0	27	November 22, 2024

How to finetune mT5

Related topics