Input of compute_metrics in ASR model

jolurf · April 17, 2021, 6:09pm

What should be the input of the function below?
Is it the model or a pass forward of the model?

def compute_metrics(pred):
    pred_logits = pred.predictions
    pred_ids = np.argmax(pred_logits, axis=-1)

    pred.label_ids[pred.label_ids == -100] = processor.tokenizer.pad_token_id

    pred_str = processor.batch_decode(pred_ids)
    # we do not want to group tokens when computing the metrics
    label_str = processor.batch_decode(pred.label_ids, group_tokens=False)

    wer = wer_metric.compute(predictions=pred_str, references=label_str)

    return {"wer": wer}

Also, I have one question about how the Trainer class works…
I mean, it encloses everything, but does it update the weights of the model automatically or it creates another instance for model?

sgugger · April 19, 2021, 12:57pm

The input is a namedtuple of type EvalPrediction. It should have one key predictions for the predictions (will have the structure as the output of your model, so one tensor if your model outputted one tensor, a tuple of two tensors if that’s what your model returns, et.) and one key label_ids that will contain all the labels.

The Trainer does update the weights of the model, otherwise, the model would not be… well… training

jolurf · April 19, 2021, 1:23pm

I didn’t get the part of trainer not updating the weights of the model… because training would be about updating the weights on the hidden layers after the transformers and before the output… so what exactly does it do? Also, after using it, I cannot use the model to predict results anymore

Topic		Replies	Views
Custom model for Trainer 🤗Transformers	1	388	July 8, 2023
How to define the compute_metrics() function in Trainer? 🤗Transformers	3	16673	December 20, 2021
Trainer doesn't get to compute_metrics after upgrading to v4.32 🤗Transformers	4	1459	July 2, 2024
How is compute_metrics working internally? Beginners	2	822	August 30, 2021
Compute_metrics do not find tokenizer (whisper finetuning) 🤗Transformers	1	324	March 6, 2024

Input of compute_metrics in ASR model

Related topics