Is there a way to return the "decoder_input_ids" from "tokenizer.prepare_seq2seq_batch"?

wgpubs · December 23, 2020, 1:57am

Or do we even have to pass decoder_input_ids anymore???

Looking at this example for MT5, it looks like hte answer is “no” …

from transformers import MT5ForConditionalGeneration, T5Tokenizer
model = MT5ForConditionalGeneration.from_pretrained("google/mt5-small")
tokenizer = T5Tokenizer.from_pretrained("google/mt5-small")
article = "UN Offizier sagt, dass weiter verhandelt werden muss in Syrien."
summary = "Weiter Verhandlung in Syrien."
batch = tokenizer.prepare_seq2seq_batch(src_texts=[article], tgt_texts=[summary], return_tensors="pt")
outputs = model(**batch)
loss = outputs.loss

This sure would make it easier if all we have to pass in are the “labels” and not have to deal with the decoder_input_ids ourselves when working within ConditionalGeneration models. Please lmk either way.

Thanks

Topic		Replies	Views
What is the correct form of decoder_input_ids for LEDForConditionalGeneration? 🤗Transformers	1	712	July 5, 2021
What should be shifted for decoder input for Bart Beginners	1	329	July 8, 2021
Decoder attention mask in text2text/se2seq generation encoder-decoder models 🤗Transformers	1	1642	March 22, 2022
T5 fine tuning, loss difference when using labels and decoder_input_ids 🤗Transformers	2	1182	October 12, 2020
How does T5 create the correct decoder_input_ids? 🤗Transformers	2	2691	September 21, 2020

Is there a way to return the "decoder_input_ids" from "tokenizer.prepare_seq2seq_batch"?

Related topics