How to do multi-span question answering?

sadhaklal · March 31, 2022, 4:52am

The BertForQuestionAnswering architecture is perfect for single span question answering, i.e., extracting a single span (as the answer) from (i) a context & (ii) a question. It outputs two tensors: start_logits & end_logits.

But what if the "train" split of the dataset provides multiple spans as the answer? (In this case, the objective is to predict multiple spans as the answer for a single example.)

What architecture works for this problem? Do we have to create something custom, or does HF Transformers provide a class (similar to BertForQuestionAnswering)?

nielsr · March 31, 2022, 8:02am

Hi,

A paper showed that you can treat multi-span question answering as token classification. They got really nice results with it: https://aclanthology.org/2020.emnlp-main.248.pdf

So basically you can treat a multi-span QA problem as a sequence labeling problem, hence you can use BertForTokenClassification.

Topic		Replies	Views
Question about BERT for qa Beginners	0	600	June 30, 2022
How can I put multiple questions in the same context at once using Question-Answering technique (i'm using BERT)? Beginners	2	1479	October 1, 2021
How to analyze ROCstories with `BertForQuestionAnswering`? 🤗Transformers	1	293	November 5, 2020
Which model of transformers to use if I want to do multiclassification of a pair of sentences containing a questionair 🤗Transformers	0	255	February 18, 2022
Converting input text sequences for relation extraction/classification Models	0	355	February 21, 2021

How to do multi-span question answering?

Related topics