Generate without using the generate method

yblainm · September 2, 2023, 2:22pm

That might be because this doesn’t cache the hidden states when generating, if I understand correctly. You would need to keep past_key_values or something like that by making sure use_cache is True in your model config.

Otherwise in the above snippet you’re re-computing the entire past sequence every time you want a next token, despite the fact that causal attention means all the past hidden states are constant.

Topic		Replies	Views
Using model() instead of model.generate() 🤗Transformers	3	524	January 30, 2025
Encoder-Decoder model only generates bos_token's [<s><s><s>] Models	17	3176	December 6, 2022
Using generate() method with decoder Models	0	569	January 16, 2022
Rewriting generate function for manual decoder input 🤗Transformers	7	3567	July 11, 2022
Control EncoderDecoderModel to generate tokens step by step 🤗Transformers	8	2607	June 8, 2022

Generate without using the generate method

Related topics