Input format for T5 model in Question Answering task

hduc-le · February 3, 2023, 4:41am

Hi, everyone. I am working with the T5 model for fine-tuning question-answering tasks with the custom dataset. My dataset contains 3 parts (question, question_context, answer_text). According to my understanding, we can encode the question + question_context into input_ids and feed it to the model, likewise, the answer_text will be labels for evaluating the model generation. I looked around the implementation of T5ForConditionalGeneration and realized that if the users input the labels and do not define the decoder_input_ids, they will assign it by the labels and shift it right for performing autoregressive internally. Here is the implementation for the code. However, in my case, I want to feed the model with the question as input_ids, question_context as decoder_input_ids, and answer_text as labels, so does the T5ForConditionalGeneration will shift right the decoder_input_ids for me internally or I have to do it by my self externally?

Hope you guys help me clear this situation. Thank you in advance!

Topic		Replies	Views
Finetune T5 with T5ForConditionalGeneration to multitask for Q&A and Summarization 🤗Transformers	0	638	November 28, 2023
Presenting A Pair of Inputs For A New T5 Model Beginners	0	219	October 19, 2022
T5Model predict <UNK> Beginners	0	223	October 5, 2022
T5 for conditional generation: getting started Beginners	20	18592	July 19, 2023
T5 Model Generate and Model Outputs Vastly Different Beginners	1	816	September 11, 2022

Input format for T5 model in Question Answering task

Related topics