How to insert a end-sequence

rudolfvonkrugstein · September 13, 2021, 12:35pm

I am new to HuggingFaces and I am trying to use the GPT-Neo model to generate the next sentence in a conversation (basically like a chatbot).

I tried around with GPT-3 before, and there I was using “Me:” as an End-Sequence to ensure the model would stop generating when it genrated the Text “Me:” (which indicates that it is my turn to say something).

Is there a similar option for GPT-Neo?

Ryulord · October 2, 2021, 8:15am

Just hopping in to say I have the exact same question in the hopes it’ll encourage someone to answer.

louis030195 · December 14, 2021, 10:19am

same here!

AtherionGG · March 22, 2022, 3:50am

There is the tokenizer.eos_token which is basically <|endoftext|>

I’m still a beginner too, I have tried to use it in various ways but nothing seems too fitting of getting good endings and results.

jdwx · March 22, 2022, 2:46pm

You have a couple of different options, but neither is perfect.

The easiest option is to generate the longest text you can stand. Your stopword (“Me:”) will probably get generated at least once in there, possibly several times. So do re.sub() to remove the first occurrance of your stopword and everything after it in the generated text.

That’s very easy to implement, the downside is it’s computationally expensive because you will usually wind up generating a ton of stuff only to throw it away.

If you are using model.generate(), you can also use the stopping_criteria parameter with a Callable class that checks to see if your stopword has been generated and returns True if it has to stop further generation. That gets tricky if you are generating multiple sequences (num_return_sequences>1). You’ll have to wait until they have all generated the stopword to end generation, meaning you’ll still have to trim after. And it’s rare (but not impossible, especially if temperature is low) for all sequences to produce the stopword at the same position.

There may also be a better way that I’m not aware of; I’m new at this.

Topic		Replies	Views
Stopping `model.generate()` based on custom token Intermediate	2	4394	October 18, 2021
How can I stop text generation naturally in an LLM running locally with Hugging Face, without using a hard MAX TOKEN limit? 🤗Transformers	1	383	November 8, 2024
Controlled Text Generation 🤗Transformers	2	2585	March 26, 2022
Ensure the sentence is complete during generation 🤗Transformers	5	7049	December 19, 2024
Stop generation while using past in GPT-2 Beginners	0	1088	November 15, 2020

How to insert a end-sequence

Related topics