Is there a way to return the "decoder_input_ids" from "tokenizer.prepare_seq2seq_batch"?

valhalla · December 29, 2020, 6:47am

Yes, we should manually replace pad token with -100 in labels.

Ideally yes, it should start with bos token, but in the original fairseq implementation the models are trained with <eos> <bos> X .... , so we have kept it like that for reproducibility.

Topic		Replies	Views
What is the correct form of decoder_input_ids for LEDForConditionalGeneration? 🤗Transformers	1	712	July 5, 2021
What should be shifted for decoder input for Bart Beginners	1	329	July 8, 2021
Decoder attention mask in text2text/se2seq generation encoder-decoder models 🤗Transformers	1	1642	March 22, 2022
T5 fine tuning, loss difference when using labels and decoder_input_ids 🤗Transformers	2	1181	October 12, 2020
How does T5 create the correct decoder_input_ids? 🤗Transformers	2	2689	September 21, 2020

Is there a way to return the "decoder_input_ids" from "tokenizer.prepare_seq2seq_batch"?

Related topics