How to set input to validate of T5 Model

kwonmha · February 20, 2023, 2:14pm

I’m fintuning T5 for superglue datasets.

Now, I use first half of ‘validation’ set as validation set and the second half of ‘validation’ set as test set.

I’m little bit confused of how to pass decoder_input_ids on validation step.

For example,
input_ids, attention_mask could be obtained from tokenize("My model outperformed baseline models").
And in my opinion, I should give just start token as decoder_input_ids with the variables above to forward() . Is it correct?
This results in the length of the output become 1(same as start token).
Do I need to just pass tokenized ‘labels’ without decoder_input_ids like in training step?

And I’m not sure if I need to use generate() instead of forward().

joaogante · February 21, 2023, 3:44pm

Hey @kwonmha Have you seen our examples? This one, focused on GLUE, may give you the answers you need

Topic		Replies	Views
T5 - model.generate() issue Beginners	2	700	March 18, 2024
T5 fine tuning, loss difference when using labels and decoder_input_ids 🤗Transformers	2	1178	October 12, 2020
Input format for T5 model in Question Answering task 🤗Transformers	0	748	February 3, 2023
T5 models: About the decoder_input_ids argument Models	0	763	December 5, 2022
SEBIS{URGENT},ValueError: You have to specify either decoder_inputs or decoder_inputs_embeds Models	3	1205	January 1, 2021

How to set input to validate of T5 Model

Related topics