Using nucleus sampling and temperature at the same time

smfsamir · June 27, 2023, 5:39am

Hi,

I have a finetuned FlanT5 model and I’m trying to use it for inference with the model.generate method. I’m inspecting the behaviour of decoding methods that alter the next-token probability distribution, specifically the top_p parameter (for nucleus sampling) and the temperature parameter. I was wondering what happens if I specify both top_p and temperature? Will it first flatten the distribution with a high temperature and then obtain the nucleus of this flattened distribution (i.e., temperature, then nucleus)? Or will it obtain the nucleus and then use the temperature to flatten the distribution (i.e., nucleus, then temperature). Or something else (e.g., only use nucleus, and ignore temperature, or vice versa).

Thank you!

Topic		Replies	Views
Order of execution of Top-K, Top-P sampling along with temperature 🤗Transformers	1	3687	October 31, 2023
Llama-2 generation_config top_p=0.6 Models	3	10464	August 8, 2023
Default parameters when querying models with TGI Intermediate	0	346	April 23, 2024
Sampling with a temperature schedule Beginners	0	383	October 27, 2022
Get the top k token probabilities of t5 first token Beginners	0	423	July 24, 2021

Using nucleus sampling and temperature at the same time

Related topics