Can we force first token by model.config.forced_bos_token_id?

kangje · April 12, 2022, 4:43pm

I am using mbart-large-cc25 (MBartForConditionalGeneration) model to finetune it for multi-lingual tasks.

To evaluate the model, I need the model to generate translations for specific target language each time. How can we force the bos token for mbart?

I know that we can use model.generate with forced_bos_token_id but this is way too slow compared to just forward computing withmodel(**inputs). So I tried to set the forced_token_id by model.config.forced_bos_token_id=[lang_id] once I load a model… but it seems that the model starts generation with a random token regardless.

How can we make the model do the forward computing with forced bos token, without using model.generate?

Topic		Replies	Views
How to force bos_token_id for each example individually in MBart? 🤗Transformers	3	1183	February 16, 2024
Encoder-Decoder model only generates bos_token's [<s><s><s>] Models	17	3149	December 6, 2022
Force mBART to generate tokens in target language during backtranslation Models	0	490	March 22, 2021
Setting target language codes in mT5 🤗Transformers	0	145	December 15, 2023
BART - Input format Intermediate	4	1785	December 13, 2023

Can we force first token by model.config.forced_bos_token_id?

Related topics