Example of prefix_allowed_tokens_fn() while text generation

vnik18 · June 7, 2021, 9:06pm

Hello, I would like to use the prefix_allowed_tokens_fn as an input to the model.generate() function, in order to perform constrained text generation with BART.

github.com

huggingface/transformers/blob/master/src/transformers/generation_utils.py#L670

    
      
              length_penalty: Optional[float] = None,
              no_repeat_ngram_size: Optional[int] = None,
              encoder_no_repeat_ngram_size: Optional[int] = None,
              num_return_sequences: Optional[int] = None,
              max_time: Optional[float] = None,
              max_new_tokens: Optional[int] = None,
              decoder_start_token_id: Optional[int] = None,
              use_cache: Optional[bool] = None,
              num_beam_groups: Optional[int] = None,
              diversity_penalty: Optional[float] = None,
              prefix_allowed_tokens_fn: Optional[Callable[[int, torch.Tensor], List[int]]] = None,
              output_attentions: Optional[bool] = None,
              output_hidden_states: Optional[bool] = None,
              output_scores: Optional[bool] = None,
              return_dict_in_generate: Optional[bool] = None,
              forced_bos_token_id: Optional[int] = None,
              forced_eos_token_id: Optional[int] = None,
              remove_invalid_values: Optional[bool] = None,
              synced_gpus: Optional[bool] = None,
              **model_kwargs,
          ) -> Union[GreedySearchOutput, SampleOutput, BeamSearchOutput, BeamSampleOutput, torch.LongTensor]:

I tried to adapt the function in the original repository here, but it doesn’t seem to be working. Can you please tell me if there are any examples of the kinds of functions that can be given as input to this parameter? Thank you!

keshavkolluru · July 18, 2022, 5:45am

Probably too late but just in case it helps someone else - the following code worked for me:

from transformers import AutoConfig, AutoModelForSeq2SeqLM, AutoTokenizer
from genre.trie import MarisaTrie

model = AutoModelForSeq2SeqLM.from_pretrained(‘t5-base’)
tokenizer = AutoTokenizer.from_pretrained(‘t5-base’)

trie = MarisaTrie([[0]+tokenizer.encode(‘Hello World’)])

output = model.generate(tokenizer.encode(‘Hello World’, return_tensors=‘pt’), prefix_allowed_tokens_fn=lambda batch_id, sent: trie.get(sent.tolist()))

The above snipped will always produce “Hello World” as the output. You can also include multiple strings when creating the Marisa trie.

The [0] is required at the start as the t5 model always produces 0 as the first token.

The definition of the trie is taken from here: https://github.com/facebookresearch/GENRE/blob/main/genre/trie.py and the file requires ‘pip install marisa-trie’ to be installed in the environment.

bryanzhou008 · July 21, 2022, 8:33pm

Thank you so much for this answer, it is exactly what I‘ve been searching for!

Topic		Replies	Views
Prohibit GPT-2 from generating some words on a condition 🤗Transformers	7	1113	April 25, 2021
T5 generate only words of the input Beginners	0	294	December 5, 2021
Generation but constraining first few tokens 🤗Transformers	0	735	December 25, 2022
Multi-decoder text generation with BART 🤗Transformers	0	625	June 7, 2021
Text generation using custom constraints 🤗Transformers	0	696	August 25, 2022

Example of prefix_allowed_tokens_fn() while text generation

Related topics