Stopping criteria BLOOM

rwheel · August 4, 2022, 7:06am

Hi, I’ve spent a couple of days reading topics in the forum about model stopping criteria, but I didn’t find a solution. Anyway, if the topic is repeated, sorry in advance!

I’m using the BLOOM model and I want to stop text generation when a set of special characters are found, like ‘###’, but I can’t achieve it. I know that I can implement a piece of code to post-process the generated text and extract the expected result, but it would be interesting to stop text generation when a criteria is fulfilled to save some words/tokens in the task.

I’m using the parameter eos_token_id to that end, but it doesn’t work. I thought that I was doing something wrong, but today I tried the model Incoder with the same eos_token_id parameter and it works! (note that in the code below I changed both API_URL and tokenizer to make reference to Incoder model instead of BLOOM)

Anyone know if this is a specific problem of BLOOM or am I doing something wrong?

tokenizer = AutoTokenizer.from_pretrained("bigscience/bloom")
end_sequence = '###'

payload = {
    "inputs": f"{context} \nHuman: {nl_query} \nAI: ",
    "parameters":
    {
        "top_p": 0.9,
        "temperature": 0.2,
        "max_new_tokens": 40,
        "eos_token_id": int(tokenizer.convert_tokens_to_ids(end_sequence)),
        "return_full_text": False
    }, 
    "options": 
    {  
          "use_cache": True,
          "wait_for_model": True
      }
}
response = requests.post(API_URL, headers=headers, json=payload)

Thanks!

fgatti675 · December 7, 2022, 7:17pm

Hi @rwheel
I am facing exactly the same problem!
Could you solve this?
Thanks

rwheel · December 9, 2022, 7:08am

Hi @fgatti675
Not at all… But I posted the same question in the BLOOM community (bigscience/bloom · Stopping criteria in text generation) and they answered me the following:

Hey! The problem with BLOOM is it has never really seen an end-of-sequence token during text. Our finetuned BLOOMZ model will automatically stop when it deems it appropriate to do so (Normally after fulfilling the user requested task).

Topic		Replies	Views
StoppingCriteria - do not include the last triggering token Intermediate	0	330	January 18, 2023
How to set stopping criteria in model.generate() when a certain word appears 🤗Transformers	3	3684	February 18, 2024
Ensure the sentence is complete during generation 🤗Transformers	5	7046	December 19, 2024
Generate function and stopping criteria - stop when generated entire word (continue if subtoken merely part of word) Beginners	0	2142	March 3, 2023
Stopping criteria for batch 🤗Transformers	7	4163	April 5, 2024

Stopping criteria BLOOM

Related topics