LongformerForQuestionAnswering - reaching TriviaQA leaderboard results

sapirw · December 9, 2020, 11:39am

Hi everyone,

I’m trying to reach the reported leaderboard results of Longformer (from the paper), and I am struggling.
Steps that I took:

I downloaded TriviaQA’s original dev set.
I’m using LongformerForQuestionAnswering for evaluation.
I normalize the predicted answers and compare them to the gold-label answers to compute ExactMatch.

Am I missing something? Should any further processing be done before evaluating with LongformerForQuestionAnswering?
I already looked at the Github repo of Longformer, it doens’t seem like they do any additional preprocessing to the dev data/context.

Jung · December 9, 2020, 12:41pm

Maybe @beltagy can help

sumeetsandhu · March 28, 2022, 10:13pm

I’ve been struggling for weeks to fine-tune this model with kaggle data on tensorflow…

Topic		Replies	Views
Https://huggingface.co/allenai/longformer-large-4096-finetuned-triviaqa Model cards	0	1139	March 28, 2022
Demo of Open Domain Long Form Question Answering Beginners	13	4520	February 8, 2021
Returning Multiple Answers for a QA Model on SageMaker Amazon SageMaker	4	1654	January 23, 2023
Trainer won't use GPU for evaluation :( 🤗Transformers	1	899	September 25, 2021
Fine-tuned longformer classifies all test samples as False Beginners	0	351	May 19, 2022

LongformerForQuestionAnswering - reaching TriviaQA leaderboard results

Related topics