Transformers BERT QA Task: run_qa.py vs run_qa_no_trainer.py

Jinawei · November 11, 2021, 1:44am

I am going to use transformers to prune a BERT model for downstream task squadv1.1. And there are two scripts in the examples of transformers. (run_qa.py and run_qa_no_trainer.py)

Because I have to add some additional code to the training process, I choose the script without the trainer API.

However, I can’t reproduce the squadv1 result with this script. Can anyone provide a suite of hyperparameters that can help me to train a BERT model with run_qa_no_trainer.py that can generate an f1 score of 88 in the Squad v.1 task?

(I have 4 super 2020 Ti GPU)

Or is there any optimization for run_qa.py? should I transfer to run_qa.py?

Thank you very much.

Topic		Replies	Views
In the tutorial of 'Summary of the tasks' how can I run script like run_qa.py Beginners	0	210	September 24, 2021
Run transformer fine-tuning of BERT 🤗Hub	0	444	November 27, 2022
Evaluating QA model on single SQuAD file Beginners	1	730	June 7, 2021
Run_qa.py related query Beginners	2	445	January 6, 2023
Pre-training & fine-tuning BERT on specific domain with custom dataset Beginners	4	4267	August 10, 2021

Transformers BERT QA Task: run_qa.py vs run_qa_no_trainer.py

Related topics