Logits in Question Answering model

NR1 · March 9, 2022, 1:31am

Hi,

I trained a question answering model based on the squad dataset. However, regardless on the model architecture I use (Electra, Bert, Roberta, etc.). There are cases when the model predicts an ‘start logit’ greater than the ‘end logit’.

Why is this the case? All the samples in the squad dataset have an end position greater than the start position.

Thanks!

ahmeda335 · May 8, 2024, 5:38am

From NLP course at Huggingface

Topic		Replies	Views
What's the difference between a QA model trained with SQuAD1.0 and SQuAD2.0? 🤗Transformers	2	905	July 15, 2020
Question about BERT for qa Beginners	0	594	June 30, 2022
How to analyze ROCstories with `BertForQuestionAnswering`? 🤗Transformers	1	286	November 5, 2020
How to understand the answer_start parameter of Squad dataset for training BERT-QA model + practical implications for creating custom dataset? Intermediate	1	1002	September 1, 2023
How to get answerability scores from QA models? Models	0	321	September 22, 2021

Logits in Question Answering model

Related topics