How many steps or epochs to finetune T5-small/base/large on XSum?

prikmm · August 7, 2021, 6:35am

Hii, I am currently trying to finetune T5 on XSum using TPU and I trained t5-small/base/large for 3 epochs. I have been getting near constant train loss and constant validation loss.

t5-small example

Epoch | Training Loss | Validation Loss
1 | 9.120000 | 12.475000
2 | 9.200000 | 12.475000
3 | 8.960000 | 12.475000

I would like to know if I can get a rough estimate of how many steps or epochs are required to converge each model? The rouge scores are nowhere near the authors.

Thank you

Topic		Replies	Views
Finetuning T5 for Summarisation - Poor results Intermediate	1	533	April 28, 2024
T5 model for summarization far from SOTA results Models	0	1345	July 2, 2021
OOM Error when trying to save t5-large using save_strategy="epoch" Beginners	0	510	August 5, 2021
T5 finetuning metrics not improving 🤗Transformers	1	341	June 20, 2023
T5 Finetuning not converging Models	0	480	August 18, 2023

How many steps or epochs to finetune T5-small/base/large on XSum?

Related topics