Predicted Start_index < Predicted End_index in BertForQuestionAnswering

JHH11 · September 1, 2021, 12:24pm

We want to fine-tuned a QA model, which is based on BertForQuestionAnswering.
After training, we can get a span-start/end scores by input_ids/token_type_ids/attention_mask and choose the indices with maximum span-start/end scores as predicted start_index and predicted end_index .
But, sometimes predicted start_index would less than predicted end_index .
If any reasonable method to solve this situation, thanks~

Ex:
span-start scores = [-0.1, -2.1, 0.7, 1.3, 4.1]
span-end scores = [-0.7, 3, 5, -0.7, 3.3]
=>
predicted start_index = 4
predicted end_index = 2

It is not reasonable.

sgugger · September 1, 2021, 12:28pm

Those predictions are not considered usually.

Topic		Replies	Views
Question about BERT for qa Beginners	0	594	June 30, 2022
How to analyze ROCstories with `BertForQuestionAnswering`? 🤗Transformers	1	286	November 5, 2020
SQuAD with BERT tokenizer: Mismatch between span and token boundaries Models	0	505	November 12, 2021
How to build an extractive question answering model without knowing the start index of the answer? Beginners	0	445	June 20, 2023
How to understand the answer_start parameter of Squad dataset for training BERT-QA model + practical implications for creating custom dataset? Intermediate	1	1002	September 1, 2023

Predicted Start_index < Predicted End_index in BertForQuestionAnswering

Related topics