Llama-2 generation_config top_p=0.6

cygu · August 8, 2023, 9:45am

Thanks for the response! I guess you probably won’t know the answer to this, but is it possible that the temperature and top_p were somehow swapped on accident? In the official Llama repo, it seems that a temperature of 0.6 and top_p of 0.9 are used, (https://github.com/facebookresearch/llama/blob/main/example_text_completion.py, https://github.com/facebookresearch/llama/blob/main/example_chat_completion.py). Whereas the generation_config.json has them swapped, temperature of 0.9 and top_p of 0.6.

Topic		Replies	Views
Using text-generation pipeline for Llama-2-7b-chat-hf setting high T doesn't change output 🤗Transformers	1	3646	December 20, 2023
Question about the temperature parameter in the Hugging Face Inference API Beginners	1	551	December 28, 2024
Help with Llama 2 Finetuning Setup Beginners	16	15930	May 20, 2024
Default parameters when querying models with TGI Intermediate	0	336	April 23, 2024
Using nucleus sampling and temperature at the same time Models	0	417	June 27, 2023