Stopping `model.generate()` based on custom token

jodiak · February 1, 2021, 3:21am

Hello everyone, I’ve managed to train a huggingface model that generates coherent sequences based on my training data and am using generate to create these new sequences. This has worked well enough so far however I need to stop sequence generation based on the count of a particular token that denotes the start of a subsequence in my domain. Is there a way to leverage the generate() method to do this? ie rather than generate based on length generate until n number of a particular token are generated.

jodiak · February 22, 2021, 7:28am

I found that the best way to do this is by directly calling the model with the necessary inputs rather than using the generate method, and to build logic around this that checks the number of a particular token in the resulting sequence and stops once its reached.

StackSmasher · October 18, 2021, 2:03pm

Can you share your code snippet for doing this ? I want to implement a similar custom generate function but can’t parse through the entire codebase in a short time

Topic		Replies	Views
Generate function and stopping criteria - stop when generated entire word (continue if subtoken merely part of word) Beginners	0	2145	March 3, 2023
How to stop a step2step generation model while streaming Beginners	0	193	September 19, 2023
How to set stopping criteria in model.generate() when a certain word appears 🤗Transformers	3	3727	February 18, 2024
How to stop after generating "###" in transformers? Beginners	0	852	May 3, 2023
Implimentation of Stopping Criteria List Beginners	24	30421	January 24, 2025

Stopping `model.generate()` based on custom token

Related topics