Run_glue.py provides higher GLUE score on bert-base-uncased

pyjhzwh · April 6, 2023, 2:35am

According to transformers/README.md at main · huggingface/transformers · GitHub, I run run_glue.py for bert-base-uncased. I got similar number as in the README.md table. However, many tasks scores are much higher than the numbers reported in the Bert paper https://arxiv.org/pdf/1810.04805.pdf.
For example, Matthews corr of CoLA is 56.53 on README.md, and is 57.78 for my finetuned results using run_glue.py, but is only 52.1 in the Bert paper Table1.
Could any explain this? Do I miss someting? Thanks!

Topic		Replies	Views
`run_glue.py` with my own dataset of one-sentence input 🤗Transformers	6	7401	July 18, 2021
Why do the F1 and accuracy scores vary when I run the run_glue.py script from Hugging Face's Transformers library for the BERT-base model on the MNLI task, while using different numbers of GPUs? 🤗Transformers	0	148	June 19, 2023
Reproduce BERT and RoBERTa 🤗Transformers	1	975	July 24, 2023
Does it make sense that continue training BERT by wikipedia corpus drop the GLUE score? 🤗Transformers	0	312	June 22, 2022
Why run_glue.py does change the Tiny BERT Model? 🤗Transformers	0	132	February 10, 2024

Run_glue.py provides higher GLUE score on bert-base-uncased

Related topics