Using text-generation pipeline for Llama-2-7b-chat-hf setting high T doesn't change output

kechan · August 1, 2023, 1:24pm

I am trying out “meta-llama/Llama-2-7b-chat-hf”

model_name = “meta-llama/Llama-2-7b-chat-hf”
pipeline = transformers.pipeline(“text-generation”,
model=model_name,
torch_dtype=torch.float16,
device_map=“auto”)
sequences = pipeline(prompt,
num_return_sequences=1,
temperature=5.0, top_p=1.0, top_k=0,
eos_token_id=tokenizer.eos_token_id,
max_length=1000)

I set T=5 that’s pretty high to ensure I get variable results. But curiously, I am getting the exact same output from the llama. I do believe my top_p and top_k are correctly set but could be wrong. anyone observed the same?

svetG · December 20, 2023, 11:42am

use this table pls to setup parameter values for Llama:

Temperature of 5 is out of reach (max=1, default=0.5), top_p=1 means that you use all of 100% generated options (default=0.9), and top_k is not something you usually tweak (usually is 40 by default for most of the LLMs)

Topic		Replies	Views
Pipeline Llama3 Text Generation Saving a Memory/Cache Beginners	9	2278	January 5, 2025
meta-llama/Llama-2-7b-chat-hf weird responses, compared to the ones returned by the HF API 🤗Transformers	1	115	February 2, 2025
Llama-2 generation_config top_p=0.6 Models	3	10476	August 8, 2023
LLama 70B not working Beginners	1	1348	August 8, 2023
Help with Llama 2 Finetuning Setup Beginners	16	16006	May 20, 2024

Using text-generation pipeline for Llama-2-7b-chat-hf setting high T doesn't change output

Related topics