Using perplexity as metric during training

mirix · June 7, 2023, 6:52am

I have followed the tutorial AutoModelForSequenceClassification for fine-tuning using accuracy and other metrics and all went fine.

Now I am following the tutorial AutoModelForMaskedLM for domain adaptation and everything is good.

However, I am struggling to use a metric in the same way that I did before, so that it is reported after each epoch.

Of course, the following works, but it is only reported before and after training:

eval_results = trainer.evaluate()
print(f">>> Perplexity: {math.exp(eval_results['eval_loss']):.2f}")

However, the following training argument does not:

#metric_for_best_model='perplexity',

I have also tried the following training argument:

#compute_metrics=compute_metrics,

The question is, how do I define the function?

#def compute_metrics(eval_results):
#	return math.exp(eval_results['eval_loss'])

That does not work either.

Best,

Ed

Topic		Replies	Views
How can I use evaluate's perplexity metric on a model that's already loaded? Intermediate	0	1668	July 28, 2023
Log Perplexity using Trainer 🤗Transformers	2	1950	October 9, 2021
Compute Perplexity using compute_metrics in SFTTrainer Intermediate	1	932	January 22, 2025
Calculating perplexity from hidden_states Intermediate	2	1375	March 21, 2023
Useful compute_metrics functions for perplexity 🤗Transformers	0	632	September 29, 2022