Default parameters when querying models with TGI

shaily99 · April 23, 2024, 7:50pm

I am using the AsyncClient() from Text Generation Inference to query a bunch of models including Llama3, Mixtral, Llama2, Vicuna, and Command-R. I am using the generate() function from the AsyncClient. I see that unless passed, the value of top_p and temperature are set to None.
However, I am assuming that there must be some default temperature and top_p setting. Where do I find this information? Is this model specific or set by TGI?

Topic		Replies	Views
Question about the temperature parameter in the Hugging Face Inference API Beginners	1	725	December 28, 2024
Llama-2 generation_config top_p=0.6 Models	3	10478	August 8, 2023
Using nucleus sampling and temperature at the same time Models	0	431	June 27, 2023
Multiple responses with async generate in TGI Intermediate	1	521	April 23, 2024
TGI Model Question 🤗Hub	0	371	September 21, 2023

Default parameters when querying models with TGI

Related topics