How can I use evaluate's perplexity metric on a model that's already loaded?

Imnimo · July 28, 2023, 9:26pm

Following the example here, I can create compute perplexity for a model I have previously saved like this:

perplexity = load("perplexity", module_type="metric")
results = perplexity.compute(predictions=dataset,model_id='my-saved-model')

But this only lets me specify a model_id, which is then loaded. I can’t do something like this:

model = AutoModelForCausalLM.from_pretrained('my-saved-model')
perplexity = load("perplexity", module_type="metric")
results = perplexity.compute(predictions=dataset,model_id=model)

Is there an alternative way to use evaluate’s perplexity metric that doesn’t require me to point to a saved model on disk?

For larger context, I’m trying to follow the quantization pipeline here and want to use perplexity as my criterion. But this requires computing perplexity on the model as it’s being updated in memory. If evaluate’s perplexity metric is not the correct tool for this job, is there something else I should be using?

Topic		Replies	Views
Using perplexity as metric during training Beginners	0	1657	June 7, 2023
Calculating perplexity from hidden_states Intermediate	2	1375	March 21, 2023
Compute Perplexity using compute_metrics in SFTTrainer Intermediate	1	932	January 22, 2025
Calculating Perplexity for Quantized Llama 3 8B & Mistral 7B Models: Evaluate Library vs. Custom Code? Beginners	3	176	March 16, 2025
Perplexity randomly failing due to missing cache file Beginners	1	582	January 27, 2024

How can I use evaluate's perplexity metric on a model that's already loaded?

Related topics