Could you please guide me how loss function for T5 is computed? I mean it is a seq2seq model. Suppose it must map a sequence of X tokens to Y tokens, but it generates Z tokens. How Y and Z are compared to calculate loss?

What is loss function for T5

nielsr November 16, 2021, 3:06pm 6

Hi,

Yes, all parameters of the model can be slightly updated when fine-tuning the model. The parameters include the token embeddings, but also the weights of the self-attention layers, the language modeling head, etc.

Topic		Replies	Views
Question regarding T5ForConditionalGeneraton loss in the example Beginners	0	324	January 4, 2021
Using Trainer class with T5 - what is returned in EvalPrediction dict? 🤗Transformers	8	5323	February 14, 2022
Finetuning T5 on translation task 🤗Transformers	0	492	September 10, 2021
T5 fine tuning, loss difference when using labels and decoder_input_ids 🤗Transformers	2	1182	October 12, 2020
How to add a custom objective function based on the generated target sentence tokens from a T5 model during training? Models	0	239	March 17, 2023

What is loss function for T5

Related topics