Is there a way to return the "decoder_input_ids" from "tokenizer.prepare_seq2seq_batch"?

wgpubs · December 26, 2020, 7:42pm

Are the target tokens (the labels) replaced with the ignore token id somewhere as well? Doesn’t look like it from what I can see … so I’m assuming we need to do that ourselves, and pass the label ids with padding tokens set to -100.

Also, the decoder_input_ids come back in the form of <eos> <bos> X ..., but my understanding was always that it should start with <bos> and the labels shifted so that <bos> attempts to predict X[0] and so forth.

Topic		Replies	Views
What is the correct form of decoder_input_ids for LEDForConditionalGeneration? 🤗Transformers	1	712	July 5, 2021
What should be shifted for decoder input for Bart Beginners	1	329	July 8, 2021
Decoder attention mask in text2text/se2seq generation encoder-decoder models 🤗Transformers	1	1642	March 22, 2022
T5 fine tuning, loss difference when using labels and decoder_input_ids 🤗Transformers	2	1178	October 12, 2020
How does T5 create the correct decoder_input_ids? 🤗Transformers	2	2684	September 21, 2020

Is there a way to return the "decoder_input_ids" from "tokenizer.prepare_seq2seq_batch"?

Related topics