Error in fine-tuning BERT

lewtun · February 1, 2021, 9:40pm

Yes, you’re right that the problem is happening on the trainer.evaluate() step. It might be coming from the label_names argument in your TrainingArguments. From the docs we have:

The list of keys in your dictionary of inputs that correspond to the labels.

Will eventually default to ["labels"] except if the model used is one of the XxxForQuestionAnswering in which case it will default to ["start_positions", "end_positions"] .

So it seems you need to provide a list like ['label'] instead of the string. If that doesn’t work, you could try renaming the “label” column in your CSV files to “labels” and then dropping the label_names argument from TrainingArguments.

You can then check if it works by just running

trainer.evaluate()

which is faster than waiting for one epoch of training

As a tip, I would also specify all the implicit arguments of your TrainingArguments and Trainer explicitly, e.g. use ouput_dir="test_20210201_1200" in TrainingArguments and similarly for model and args in Trainer.

PS. one thing that looks a bit odd is the way you load the metric:

metric = load_metric('f1', 'accuracy')

I don’t think you can load multiple metrics this way since the second argument refers to the “configuration” of the metric (e.g. GLUE has a config for each task). Nevertheless, this is probably not the source of the problem.

Topic		Replies	Views
Not able to predict using Transformers Trainer class Intermediate	2	183	October 2, 2024
BERT finetuning "index out of range in self" Intermediate	2	4123	August 24, 2021
IndexError: list index out of range, when trying to predict from the fine tuned model Beginners	0	106	July 20, 2024
Failing at finetuning BERT for a NER task Beginners	2	205	May 8, 2024
Text classifier is trained incorrectly using BERT transformers (f1 = 0) for a certain amount of dataset 🤗Transformers	2	832	August 31, 2023

Error in fine-tuning BERT

Related topics