FlaxGPTNeoForCausalLM generates the same text regardless of seed, temperature, top_k and top_p values

Norod78 · August 30, 2021, 3:17pm

Hello,

I was trying to generate text using flax (just as an experiment to see if it works well on a TPU-VM machine). However, no matter how I tried, it always generates the exact same text for a given prompt. This happened both on a TPU-VM as well as local CPU inference.

Here is a short code snippet which demonstrates the problem I encountered:

from transformers import FlaxGPTNeoForCausalLM, AutoTokenizer

model_name = 'EleutherAI/gpt-neo-125M'
model = FlaxGPTNeoForCausalLM.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt_text = "Hello there, my name is"
generated_max_length = 50

#Changing the seed value, does not seem to change the outcome
seed = 1001
model.seed = seed
model.config.pad_token_id = model.config.eos_token_id

inputs = tokenizer(prompt_text, return_tensors="jax")

#Changing temperature, top_k and top_p does not seem to change the outcome
outputs = model.generate(
    input_ids = inputs["input_ids"], 
    max_length=generated_max_length, 
    do_sample=True,
    temperature=0.8,
    early_stopping=True,
    top_k=50,
    top_p=0.90)

output_sequence = outputs['sequences'].squeeze(0)
text = tokenizer.decode(output_sequence, clean_up_tokenization_spaces=True)

print(text)

#Always prints:
#Hello there, my name isergus, and I was presented a competition looking for a library keeper with the 31-K--Goods Wallace Leisure library manager on 10-11-08 447-5721. It involves teaching a Royally

I also tried calling jax.random.PRNGKey(seed) which didn’t help as well as other methods such as:

model.top_p = 0.9
model.top_k = 50

jit_generate = jax.jit(model.generate)
#jit_generate( inputs["input_ids"],  ....

I assume I’m doing something very wrong, but I was not able to find any example code for generating text with FlaxGPTNeoForCausalLM (I did find examples for training it).

I hope I posted this in the right forum.
Regards,
Doron

Norod78 · September 22, 2021, 3:06pm

Update: Tried again on the latest Master branch (transformers-4.11.0.dev0) and this time I was able to get it to generate a different output by changing the seed. However, changing temperature, top_k and top_p still does not seem to have an influence on the outcome.

import jax
from transformers import FlaxGPTNeoForCausalLM, AutoTokenizer

model_name = 'EleutherAI/gpt-neo-125M'
model = FlaxGPTNeoForCausalLM.from_pretrained(model_name)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt_text = "Hello there, my name is"
generated_max_length = 50

#Changing the seed and thus the prng_key value below, does seem to change the outcome.
seed = 1000
model.seed = seed

inputs = tokenizer(prompt_text, return_tensors="np")

#Changing temperature, top_k and top_p does not seem to change the outcome
outputs = model.generate(
    input_ids = inputs["input_ids"], 
    max_length=generated_max_length, 
    pad_token_id = model.config.eos_token_id,    
    prng_key=jax.random.PRNGKey(seed),
    temperature=1.0,
    early_stopping=True,
    top_k=50,
    top_p=0.95,
    do_sample=True,
    no_repeat_ngram_size=4)

output_sequence = outputs['sequences'].squeeze(0)
text = tokenizer.decode(output_sequence, clean_up_tokenization_spaces=True)

print(text)

Topic		Replies	Views
Making llama text generation, deterministic Models	1	9797	August 16, 2023
TFAutoModelForCausalLM vs TFGPTJForCausalLM Beginners	0	106	April 2, 2024
Beam search (FlaxT5) generates PAD tokens mid generation 🤗Transformers	1	491	November 25, 2021
Inconsistency in logit values between generation and direct model prediction #31127 🤗Transformers	0	211	May 30, 2024
Prevent repeat tokens in GPT2LMHeadModel text generation with max_new_tokens=1 Beginners	0	1116	November 19, 2021

FlaxGPTNeoForCausalLM generates the same text regardless of seed, temperature, top_k and top_p values

Related topics