Dynamic decoder token masking

MHoubre · February 13, 2023, 3:14pm

Hello,
I am currently trying to train a Sequence to Sequence model with constraints on the decoder and decoder only. For each training instance, I have a list of tokens that I do not want to see in the decoder output. This list is different for each input sequence.

I basically want to reduce the vocabulary of the decoder to a specific subset of the vocabulary and this subset is different for each input document.

I saw that in the Generation Config, you can pass a list of “suppress tokens” that gets the logits at -inf for the specified tokens. But this does not seem appropriate when the list varies a lot from one document to another.

Any idea on how to do this please? Is this even possible?

Topic		Replies	Views
Constrained decoding based on position 🤗Transformers	0	35	October 4, 2024
Sequence masking 🤗Transformers	0	379	April 25, 2022
Mask only specific words 🤗Tokenizers	4	3710	November 7, 2021
Decoder generate with prompts of variable lengths? 🤗Transformers	0	661	May 25, 2022
Decoder attention mask in text2text/se2seq generation encoder-decoder models 🤗Transformers	1	1637	March 22, 2022

Dynamic decoder token masking

Related topics