How do I evaluate a pretrained model on a test dataset?

barlen · February 24, 2022, 4:49am

Without training, how do I use a test dataset to evaluate a pretrained model?

beneyal · February 24, 2022, 5:00am

Hello

Assuming you’re using PyTorch, you can wrap your model inside a Trainer and then call trainer.evaluate(). An example (taken from here):

from transformers import TrainingArguments

training_args = TrainingArguments("test_trainer"),


import numpy as np
from datasets import load_metric

metric = load_metric("accuracy")


def compute_metrics(eval_pred):
    logits, labels = eval_pred
    predictions = np.argmax(logits, axis=-1)
    return metric.compute(predictions=predictions, references=labels)


trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=small_train_dataset,
    eval_dataset=small_eval_dataset,
    compute_metrics=compute_metrics,
)

trainer.evaluate()

If you don’t want to use a Trainer, check out the examples here and check the files ending with _no_trainer.py.

Topic		Replies	Views
Trainer.evaluate() vs trainer.predict() 🤗Transformers	6	36463	July 10, 2024
Evaluating your model on more than one dataset Beginners	3	2072	February 28, 2022
Evaluation without using a Trainer Beginners	2	3590	April 16, 2021
How to evaluate models Beginners	0	2848	June 16, 2021
KeyError when training with a dictionary as a dataset. What should the dataset look like? Beginners	0	706	October 19, 2022

How do I evaluate a pretrained model on a test dataset?

Related topics