EncoderDecoderModel Generation with Specified EOS Token

Qikai · March 15, 2021, 7:25pm

Hi,

I have been working with the EncoderDecoderModel (with bert-base-chinese) for Seq2Seq generation. I have noticed that during output generation, if I were to explicitly define the EOS token, like below:

image|690x59

then following message “Setting pad_token_id to eos_token_id:102 for open-end generation.” will be printed. Furthermore, I have noticed that my overall generated sequence will be longer than if I were to ignore (not use) the “eos_token_id” argument.

I am wondering:

What is the message about? Specifically, what is open-end generation?
What might be some reasons that setting the “eos_token_id” will cause an increase in the generated sequence length? I would think that by explicitly denoting the EOS and terminating the generation process thereafter, the generated query should be shorter.

Thanks in advance.

Topic		Replies	Views
Encoder-Decoder model only generates bos_token's [<s><s><s>] Models	17	3149	December 6, 2022
Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation Beginners	5	46187	September 24, 2024
How does GPT decide to stop generating sentences without EOS token? 🤗Transformers	13	24380	August 19, 2024
BART - Input format Intermediate	4	1785	December 13, 2023
Generate without using the generate method Intermediate	8	6135	January 17, 2025

EncoderDecoderModel Generation with Specified EOS Token

Related topics