Hi, everyone. I am working with the T5 model for fine-tuning question-answering tasks with the custom dataset. My dataset contains 3 parts (question, question_context, answer_text). According to my understanding, we can encode the question + question_context
into input_ids
and feed it to the model, likewise, the answer_text
will be labels
for evaluating the model generation. I looked around the implementation of T5ForConditionalGeneration and realized that if the users input the labels
and do not define the decoder_input_ids
, they will assign it by the labels
and shift it right for performing autoregressive internally. Here is the implementation for the code. However, in my case, I want to feed the model with the question as input_ids
, question_context as decoder_input_ids
, and answer_text as labels
, so does the T5ForConditionalGeneration will shift right the decoder_input_ids
for me internally or I have to do it by my self externally?
Hope you guys help me clear this situation. Thank you in advance!