Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation

MikeyBeez · August 29, 2022, 5:12am

The first bit of code runs, but I get the eos_token_id error. The second bit fails. So adding the token to the input_ids doesn’t work. At least the model persists for the second query, so I’m making progress.
Anyway, big thanks to Huggingface for posting these great models. You guys rock!

from transformers import GPTNeoForCausalLM, GPT2Tokenizer
from macos_speech import Synthesizer

model = GPTNeoForCausalLM.from_pretrained("EleutherAI/gpt-neo-1.3B")
tokenizer = GPT2Tokenizer.from_pretrained("EleutherAI/gpt-neo-1.3B")

prompt = (
    "In a shocking finding, "
)

input_ids = tokenizer(prompt, return_tensors="pt").input_ids

gen_tokens = model.generate(
    input_ids,
    do_sample=True,
    temperature=0.9,
    max_length=100,
)
gen_text = tokenizer.batch_decode(gen_tokens)[0]
print(gen_text)

prompt = (
    "Albert Einstein was "
)

input_ids_pre = tokenizer(prompt, return_tensors="pt").input_ids
nput_ids = input_ids_pre + tokenizer.eos_token

gen_tokens = model.generate(
    input_ids,
    do_sample=True,
    temperature=0.9,
    max_length=100,
)
gen_text = tokenizer.batch_decode(gen_tokens)[0]
print(gen_text)

MikeyBeez · August 29, 2022, 5:18am

Oops, I had a typo. I misspelled input_ids. I fixed that, and it still failed. Anyway, this token error seems to be one that a lot of people have encountered. I’ve read a lot of posts about it. Maybe a better question is, how can I find out about this sort of thing in the docs? Or would the tutorial answer this question?

MikeyBeez · August 29, 2022, 6:10am

Here’s the code that suppresses the error:

`from transformers import pipeline
import time
start = time.time()
print("Time elapsed on working...")
#generator = pipeline('text-generation', model='bigscience/bloom-560m')
#generator = pipeline('text-generation', model='gpt2')
generator = pipeline('text-generation', model='EleutherAI/gpt-neo-1.3B')
#generator = pipeline('text-generation', model='EleutherAI/gpt-j-6B')
text = generator("Albert Einstein was:", max_length=10, pad_token_id=50256, num_return_sequences=1)
print(text)
time.sleep(0.9)
end = time.time()
print("Time consumed in working: ",end - start)
text = generator("Albert Einstein was:", max_length=10, num_return_sequences=1)
print(text)
time.sleep(0.9)
end = time.time()
print("Time consumed in working: ",end - start)`

MikeyBeez · August 29, 2022, 6:12am

I added pad_token_id = 50256 to my pipeline.

soerendip · September 24, 2024, 4:22pm

What when I don’t want open ended ? The model seems to ignore now the eos token and keeps going until the max_tokes limit is reached. I would rather have it to stop when it is done.

soerendip · September 24, 2024, 4:23pm

Why 50256 and not 50258?

Topic		Replies	Views
How to set the padding configuration with Huggingface's GenerateMixin's generate method? Intermediate	7	11209	September 26, 2023
Code is working fine for Bert and Roberta However Fails During GPTNeo Beginners	2	291	February 27, 2024
Make correct padding for text generation with GPT-NEO 🤗Tokenizers	0	821	July 5, 2023
GPT-2 trained models output repeated "!" Beginners	2	2793	December 20, 2021
What is the correct format of input when fine-tuning GPT2 for text generation with batch input? Models	0	506	January 22, 2024

Setting `pad_token_id` to `eos_token_id`:50256 for open-end generation

Related topics