T5 for conditional generation: getting started

valhalla · September 29, 2020, 9:30am

You can choose whatever format that works well for you, only thing to note is your dataset or collatorshould return input_ids, attention_mask and labels.
To add new tokens

tokenizer.add_tokens(list_of_new_tokens)

# resize the embeddings
 model.resize_token_embeddings(len(tokenizer))

Using task prefix is optional.
No, you won’t need to register the task, the original T5 repo requires that but it’s not required here.

You might find these two notebooks useful

Note: These notebooks manually add the eos token (</s>), but it’s not with the current version, the tokenizer will handle that.

Here’s a great thread on tips and tricks for T5 fine-tuning

Topic		Replies	Views
Problem generating with T5ForConditionalGeneration on a custom task 🤗Transformers	2	45	January 26, 2025
Proper way to do conditional generation with T5 Beginners	1	2079	January 20, 2023
Can t5 be used to text-generation? Beginners	7	8816	April 26, 2023
Presenting A Pair of Inputs For A New T5 Model Beginners	0	219	October 19, 2022
What is loss function for T5 Models	13	12908	February 25, 2024