How to run inference on conversational QA dataset (CoQA)

Seohyeong · December 16, 2022, 7:59am

Hi everyone,

I’m fine-tuning a T5 model on the CoQA dataset. The CoQA dataset is a conversational question-answering dataset where there is a sequence of question and answer pairs for a given story (context). I’ve trained a T5 model with input being a story, the question at a t time, and q&a pairs from t=1 to t-1, which is a pretty conventional way to train with a conversational QA dataset. Also I’ve used the same input setting during the inference.

However, I think the fair way is to not use the ground truth answers from t=1 to t-1 when generating an answer for the question at t, since it could be considered cheating. The fair way to evaluate would be sequentially running inference from t = 1 to t and using the answer that model generated at the inference time as history. Then the question I have is, how can I run this evaluation algorithm with a batch size larger than 1?

If there’s anyone who has thought about a similar problem, please do share!
Thank you.

Topic		Replies	Views
T5 trained with seq2seq method 🤗Transformers	0	296	June 26, 2023
What's the best way to speed up inference on a large dataset? Beginners	3	3933	March 13, 2022
Generating actual answers from QA models Beginners	2	1174	October 26, 2021
Question-Answering/Text-generation/Summarizing: Fine-tune on multiple answers Beginners	8	5316	November 20, 2021
Run parallel api inference for QA 🤗Transformers	0	314	November 19, 2021

How to run inference on conversational QA dataset (CoQA)

Related topics