How to define the compute_metrics() function in Trainer?

olaffson · December 19, 2021, 2:41am

Hello,

Coming from tensorflow I am a bit confused as to how to properly define the compute_metrics() in Trainer. For instance, I see in the notebooks various possibilities

def compute_metrics(eval_pred):
    predictions, labels = eval_pred
    predictions = predictions[:, 0]
    return metric.compute(predictions=predictions, references=labels)

My question may seem stupid (maybe it is) but how can I know how to compute the metrics if I cannot see what eval_pred looks like in Trainer? It is as if I had to guess what the output will be before actually training the model. Am I missing something here?

Thanks!

merve · December 20, 2021, 10:44am

Hello, I’m a bit confused about your question. Are you trying to implement this on Trainer or TensorFlow?

olaffson · December 20, 2021, 1:23pm

hello @merve, thanks. I am trying to write the correct Trainer (in pytorch). The point is that I can follow the tutorial and copy-paste the compute_metrics functions there. However, if I want to modify them I need to understand and play a little bit with the eval_pred output. How can I get eval_pred from a trainer so that I can see what predictions and labels look like?

sgugger · December 20, 2021, 1:57pm

Just run trainer.predict on your eval/test dataset.

Topic		Replies	Views
Difference between using or not compute_metric Beginners	2	864	November 27, 2023
Custom model for Trainer 🤗Transformers	1	382	July 8, 2023
Trainer never invokes compute_metrics Beginners	6	7512	April 26, 2022
Where did eval_preds in compute_metrics function come from? 🤗 Course Projects	0	814	April 2, 2023
Input of compute_metrics in ASR model Beginners	2	1320	April 19, 2021

How to define the compute_metrics() function in Trainer?

Related topics