Key Error 'loss' while fine tuning GPT-2 with the Trainer utility

sebansunny · December 17, 2020, 11:47am

training_args = TrainingArguments(
  output_dir='./results',          # output directory
  num_train_epochs=3,              # total # of training epochs
  per_device_train_batch_size=16,  # batch size per device during training
  per_device_eval_batch_size=16,   # batch size for evaluation
  logging_dir='./logs',            # directory for storing logs
)

trainer = Trainer(
  model=model,
  args=training_args,
  train_dataset=tokenized_datasets["train"],
  eval_dataset=tokenized_datasets["test"],
  tokenizer=tokenizer
)

Error Log:

/usr/local/lib/python3.6/dist-packages/transformers/trainer.py in train(self, model_path, trial)
745 tr_loss += self.training_step(model, inputs)
746 else:
–> 747 tr_loss += self.training_step(model, inputs)
748 self._total_flos += self.floating_point_ops(inputs)
749
/usr/local/lib/python3.6/dist-packages/transformers/trainer.py in training_step(self, model, inputs)
1073 loss = self.compute_loss(model, inputs)
1074 else:
-> 1075 loss = self.compute_loss(model, inputs)
1076
1077 if self.args.n_gpu > 1:
/usr/local/lib/python3.6/dist-packages/transformers/trainer.py in compute_loss(self, model, inputs)
1103 self._past = outputs[self.args.past_index]
1104 # We don’t use .loss here since the model may return tuples instead of ModelOutput.
-> 1105 return outputs[“loss”] if isinstance(outputs, dict) else outputs[0]
1106
1107 def is_local_process_zero(self) -> bool:
/usr/local/lib/python3.6/dist-packages/transformers/file_utils.py in getitem(self, k)
1356 if isinstance(k, str):
1357 inner_dict = {k: v for (k, v) in self.items()}
-> 1358 return inner_dict[k]
1359 else:
1360 return self.to_tuple()[k]
KeyError: ‘loss’

sgugger · December 17, 2020, 2:01pm

If you have this error, it’s probably because you are not passing any labels to your model. It’s hard to know for sure since you don’t explain how you built your dataset.

sebansunny · December 17, 2020, 3:10pm

This is the input to the Trainer:

DatasetDict({
train: Dataset({
features: [‘attention_mask’, ‘input_ids’],
num_rows: 25
})
test: Dataset({
features: [‘attention_mask’, ‘input_ids’],
num_rows: 10
})
})

sgugger · December 17, 2020, 3:40pm

So there is no labels, which is why it can’t train.

sebansunny · December 17, 2020, 5:26pm

Thanks! could you point to me some references of adding labels?

sgugger · December 17, 2020, 5:52pm

The official examples have a fine-tuning script for causal models like GPT-2 and there is also a notebook with an example.

sebansunny · December 18, 2020, 3:58am

I ignored adding the function and this line -https://github.com/huggingface/transformers/blob/bf713cdec7c27416f514f231ba6728cfc8135120/examples/language-modeling/run_clm.py#L310. I skipped it because i was using a very small dataset. It works now! Thank you so much for your help!

Neel-Gupta · April 14, 2021, 7:29pm

Hmm…and would you have any idea if the labels did exist but it still gives the error (despite passing the label_names argument)?

Something like this:

{'attention_mask': [1, 1, 1, 1, 1, 1, 1, 1, 1, 1],
 'input_ids': [0, 18764, 9665, 38, 3572, 29228, 700, 5029, 102, 2],
 'src': 'Mizoram is ........',
 'tgt': 19}

mpaangela · July 12, 2021, 8:27pm

I have similar issue. After I change the label key into exactly the word ‘labels’. Then it worked.

gautamshahi · May 10, 2022, 11:35pm

But what is the reason that it only works with word “labels” or “label”

Topic		Replies	Views
Why am I getting KeyError: 'loss'? Beginners	9	16498	March 17, 2023
KeyError: 'loss' when fine-tuning a Transformer model Beginners	7	2493	July 12, 2022
【Solved】How can I get loss by using trainer when training gpt2? Beginners	3	948	July 21, 2022
Key Error "loss" Models	0	320	July 1, 2022
Issue in using trainer class for Finetuning GPT-2 Models	1	610	November 23, 2020

Key Error 'loss' while fine tuning GPT-2 with the Trainer utility

Related topics