I see the word “temperature” being used at various places like:
float, optional, defaults to 1.0) – The value used to module the next token probabilities.
- temperature scaling for calibration
- temperature of distillation
can anyone please explain what does it mean, or point me to a source with explanation?