T5 for conditional generation: getting started

prikmm · August 8, 2021, 7:33pm

Hey all, I have been trying to finetune T5 on XSum and I am getting constant validation loss. It doesn’t change at all. The training loss varies a but doesn’t converge like it stays in the range [10.0, 12.0]. I tried many methods like creating my own nn.Module which compatible with Trainer(), etc but none worked.
Link to colab (first version where I used default Trainer()).

Can anyone share a colab link or wandb project for my reference?

Thanks!

Topic		Replies	Views
Can t5 be used to text-generation? Beginners	7	8825	April 26, 2023
Finetune T5 with T5ForConditionalGeneration to multitask for Q&A and Summarization 🤗Transformers	0	641	November 28, 2023
How to generate text with T5Model other than T5ForConditionalGeneration? 🤗Transformers	0	300	September 22, 2022
T5forConditionalGeneration Beginners	2	2258	September 15, 2020
Problem generating with T5ForConditionalGeneration on a custom task 🤗Transformers	2	49	January 26, 2025

T5 for conditional generation: getting started

Related topics