Why past_key_values is not in GreedySearchDecoderOnlyOutput?

shunzh · September 30, 2022, 6:55pm

I am using a model = GPT2LMHeadModel() for generation. In my use case, I’ll need to call model.generate() for multiple times, and the input_ids have a shared prefix.

In my understanding, I could pass past_key_values as an argument in model.generate() so that it wouldn’t repeatedly compute the key, values of the shared prefix. However, how do I get this past_key_values? The generate() function returns a GreedySearchDecoderOnlyOutput object (I set beam size = 1, no sampling), which does not contain past_key_values. It’s only in the return of model.forward()?

So I’m wondering what would be a typical example of using past_key_values in multiple calls of model.generate(). Thanks!

shunzh · October 4, 2022, 10:55pm

Never mind. There’s a PR for it.

Topic		Replies	Views
Using past_key_values to provide context to decoder results in same output 🤗Transformers	0	716	December 23, 2023
What does the `use_cache` in `generate` actually do? 🤗Transformers	1	2479	May 9, 2024
The way to get Seq2SeqLM's `decoder_input_ids` to obtain `past_key_values` Beginners	0	1359	October 25, 2020
Does model supports partial `past_key_values`? 🤗Transformers	0	437	May 12, 2023
Forge synthetic past_key_value batch from multiple outputs Intermediate	0	482	May 12, 2021

Why past_key_values is not in GreedySearchDecoderOnlyOutput?

Related topics