Rewriting generate function for manual decoder input

patrickvonplaten · January 3, 2021, 10:10pm

The current version of generate (and also the one of v4.1.1.) already includes the possibility to provide user specific decoder_input_ids. You just have to add it to generate().

E.g. the following code works as expected

from transformers import T5ForConditionalGeneration, T5TokenizerFast

model = T5ForConditionalGeneration.from_pretrained("t5-small")
tokenizer = T5TokenizerFast.from_pretrained("t5-small")

input_ids = tokenizer("translate English to German: How are you?", return_tensors="pt").input_ids
decoder_input_ids = tokenizer("<pad> Wie geht", return_tensors="pt", add_special_tokens=False).input_ids

output = model.generate(input_ids, decoder_input_ids=decoder_input_ids, num_beams=4, num_return_sequences=4)

print("With decoder_input_ids num_beams=4", tokenizer.batch_decode(output, skip_special_tokens=True))

output = model.generate(input_ids, num_beams=4, num_return_sequences=4)

print("Without decoder_input_ids num_beams=4", tokenizer.batch_decode(output, skip_special_tokens=True))

Also see this notebook for the answers of this specific use case: https://colab.research.google.com/drive/11js9We6ZtjN15hb3-PoFZBXJrcSOo_Qa?usp=sharing

This feature was enabled by the generate refactor: Big `generate()` refactor

Does this correspond to what you were trying to achieve or are you looking for some other behavior?

Topic		Replies	Views
Why do we set input_ids = decoder_input_ids in beam search? 🤗Transformers	0	317	October 18, 2021
Generate without using the generate method Intermediate	8	6176	January 17, 2025
Encoder-decoder `generate()` with forced start for decoder? 🤗Transformers	3	962	November 18, 2022
Generate 'continuation' for seq2seq models Intermediate	1	1866	February 22, 2021
How to use decoder generation upon features and not token ids? 🤗Transformers	0	220	August 8, 2022

Rewriting generate function for manual decoder input

Related topics