What does the decoder with past values means

echarlaix · August 5, 2022, 3:38pm

Since #241, we have enabled the possibility to only export one decoder : the latter will not have pre-computed key/values as inputs. This will results in the past_key_values to be computed at each generation step. To enable this export you only need to set use_cache to False when calling the from_pretrained method. To speed up decoding by leveraging the key/values hidden-states which have already been computed in the previous generation step, you need to export a second decoder with additional pre-computed key/values as inputs.

Topic		Replies	Views
The way to get Seq2SeqLM's `decoder_input_ids` to obtain `past_key_values` Beginners	0	1361	October 25, 2020
Default for the Decoder past_key_values - Marian Intermediate	0	415	January 5, 2023
Why past_key_values is not in GreedySearchDecoderOnlyOutput? 🤗Transformers	1	2057	October 4, 2022
Control EncoderDecoderModel to generate tokens step by step 🤗Transformers	8	2622	June 8, 2022
A question about the modeling_bart.py Models	1	325	November 12, 2020

What does the decoder with past values means

Related topics