Evaluating your model on more than one dataset

konradboguslav · October 15, 2020, 8:03am

Hi,

Transformer’s Trainer and Trainingarguments classes allow for only one dataset to use for evaluation. Is there a simple way of adding another one? So, after after an epoch of training my model I could evaluate it on both training and developmental datasets and get metrics for both of them as one output? I know I could alter the training_args.py or trainer.py but I am pretty sure I would only mess things up…

sgugger · October 15, 2020, 1:29pm

I think the easiest way to do this is to use the new system of TrainerCallback and write a callback that performs a new evaluation on your other datasets during the event on_validate.

aswinsson · January 4, 2021, 10:28am

Is it possible to provide an example?

deathcrush · February 28, 2022, 4:58pm

@sgugger, I had a brief look at the interfaces provided to achieve this, but I don’t see how this is possible. As I understand, the TrainerCallback class receives the TrainerArguments which are used to initialise the Trainer but this class does not allow us to pass additional eval_dataloaders. Therefore, I am not sure how to pass the additional datasets to the Trainer in the first place.

Topic		Replies	Views
Use Trainer API with two valiation sets 🤗Transformers	1	1835	February 28, 2022
How do I evaluate a pretrained model on a test dataset? Beginners	1	8690	February 24, 2022
Using the specific loss of a dataset as the early stopping metric 🤗Transformers	0	236	March 13, 2024
How to use Trainer when eval dataset has multiple references 🤗Transformers	0	343	September 22, 2023
Trainer.evaluate() vs trainer.predict() 🤗Transformers	6	36278	July 10, 2024

Evaluating your model on more than one dataset

Related topics