Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation

allenwang37 · October 15, 2024, 6:00am

when i use llama3-7b, it seems can not stop inference until reach max generated token, what should I do?
do it related to this warning:"Setting pad_token_id to eos_token_id:128001 for open-end generation. " ?

John6666 · October 15, 2024, 7:47am

I’m not familiar with it either, but I’ve heard that that setting and Llama3 being weird are two separate issues. The first link explains the settings, the second link explains what’s wrong with Llama3. You can find more if you do a search.

allenwang37 · October 15, 2024, 8:19am

thank you a lot, i understand it , and you share me a way to see some discussion for specific model, actually i am a begginer

John6666 · October 15, 2024, 9:08am

I’m also a newcomer to AI, about six months old. I’m basically just playing around with generating pictures, so language modeling is not my area of expertise in terms of training.
I can at least guide you through HF, but are you looking for something?

If you’re looking for language models, the big ones are in the Inference Playground that HF is currently developing. It’s the first link.
Below are bookmarks of spaces I’ve seen where you can actually use language models.

allenwang37 · October 16, 2024, 7:50am

ahaa, i just think the response of llm is somewhat weird, i already found the reason of your shared link.

John6666 · October 16, 2024, 8:25am

I’m glad you got it resolved.
Well, the bottom line is that we should just use Instruct if we want to use it.
There are some failed language models that have broken output regardless of Base or anything else, so we’ll just have to keep trying one after the other. And then there are the leaderboards and other rankings.

Topic		Replies	Views
Llama model outputs strange words Beginners	0	133	December 1, 2024
How to stop LLM from going up to the max token limit? Intermediate	1	113	September 25, 2024
How to actually use padding in Lllama Tokenizers 🤗Transformers	2	4935	June 16, 2023
Llama2 pad token for batched inference Models	7	15618	March 31, 2024
Text Generation response truncation Beginners	6	1351	August 18, 2024

Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation

Related topics