How is compute_metrics working internally?

ThomasG · August 16, 2021, 2:45pm

Hi everyone, I am following this blog post Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers on fine-tuning an ASR model, and there is something I don’t understand about the compute_metrics function.

In the notebook, we want to compute the Word Error Rate for the validation set, every eval_steps steps. What is the input of this function? Does it take one batch at a time, or the whole validation dataset? If it takes one batch at a time, how is the final WER that is diplayed calculated? Is it the mean of all the WERs of all the batches?

Pinging the author of this blog post as well @patrickvonplaten . I’d appreciate any insight you guys can give me.

Thanks in advance.

sgugger · August 30, 2021, 6:10pm

The compute_metrics function takes the predictions and labels over the whole evaluation dataset and computes the metrics from them.

ThomasG · August 30, 2021, 7:53pm

Thank you.

Topic		Replies	Views
Input of compute_metrics in ASR model Beginners	2	1324	April 19, 2021
Different wer using Trainer.evaluate() and a loop with metric.add_batch Beginners	0	569	February 8, 2022
How is the eval dataset processed in a trainer? 🤗Transformers	0	511	August 28, 2023
Compute_metrics do not find tokenizer (whisper finetuning) 🤗Transformers	1	324	March 6, 2024
Code review: compute_metrics for WER with Wav2Vec2ProcessorWithLM 🤗Transformers	4	1034	April 19, 2022

How is compute_metrics working internally?

Related topics