-100 in predictions

thistlillo · December 19, 2024, 9:45am

I am using the HuggingFace Seq2SeqTrainer with predict_with_generate=True.

I get the value -100 in my predictions during the validation step. This makes the tokenizer fail (and has absolutely no sense).

Why?

I can share the code, but, based on my experience, a long piece of code turns readers away more easily than a short question.

thistlillo · December 20, 2024, 7:34am

Where can I read how and with which tokens the generated sequences are padded?
I cannot find any information on this.

These are the predictions I get (from Seq2SeqTrainer with predict_with_generate=True) when I set max_new_tokens=100:

preds[78]: [    0     3     2  3247  3321  3155     3     2  1018  6327  3155     3
     2  1018  6327  3155    37     3    75    23 17436    19     3     9
     3   729   302    13     3    75    23 17436     7    24   619    16
     8     3   729   302    96   254    23 17436   121     1  -100  -100
  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100
  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100
  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100
  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100  -100
  -100  -100  -100  -100]

The generated output is padded with the label_pad_id=-100 instead of the tokenizer padding token. Why?

When I use pure PyTorch code the predictions are padded with the token “0”.

Topic		Replies	Views
Expected workflow -100 and padding in labels in seq2seq? 🤗Transformers	0	745	December 12, 2022
Seq2seq padding 🤗Transformers	1	69	October 10, 2024
Why am I seeing `-100` values in predictions during evaluation with `compute_metrics` inside a language model task? Beginners	2	121	October 15, 2024
BART seq2seq -100 tokens in prediction Models	0	184	December 25, 2023
How to decode with custom pad tokens 🤗Tokenizers	3	4082	December 22, 2023

-100 in predictions

Related topics