How is the eval dataset processed in a trainer?

Luan77777 · August 28, 2023, 6:06pm

Hi, I’m currently trying to cumpute metrics for a SpeechT5 model ASR task but I get the following error when i try to load my compute_metrics function: Sizes of tensors must match except in dimension 0. Expected size 216 but got size 231 for tensor number 1 in the list. My compute_metrics function and my Seq2Seqtrainer look like this:

trainer = Seq2SeqTrainer(
model=model,
args=training_args,
train_dataset=train_dataset,
eval_dataset=valid_dataset,
compute_metrics=compute_metrics,
data_collator=data_collator,
tokenizer=processor
)

def compute_metrics(predictions):

    print("metrics")

    predicted_ids = predictions.predictions
    reference_ids = predictions.labels_ids

    print(predicted_ids)
    print(reference_ids)

    # label_ids[label_ids == -100] = processor.tokenizer.pad_token_id

    pred_str = processor.batch_decode(pred_ids, skip_special_tokens=True)
    label_str = processor.batch_decode(label_ids, skip_special_tokens=True)

    # wer_ortho = 100 * metric.compute(predictions=pred_str, references=label_str)

    wer = 100 * metric.compute(predictions=pred_str_norm, references=label_str_norm)

    wer_metric = wer(reference_texts, predicted_texts)

    # f1_metric = f1_score(reference_texts, predicted_texts)

    return {"wer_ortho": wer_ortho, "wer": wer}

I have read about similar errors people had on the forum but I’m not sure if I understood most of them right. I have a feeling that the eval_dataset is not processed through the data collator and therefore i dont pad my eval dataset. Some said its got something to do with the padding length, since I use the longest strategy right now some people said to use max_length. However I am using a processor to pad and my audio files are way larger than my text file so padding to max length causes Cuda out of memory errors. Is there another way to get rid of this error effectively?? Any help is appreciated!

Topic		Replies	Views
Input of compute_metrics in ASR model Beginners	2	1321	April 19, 2021
How to define the compute_metrics() function in Trainer? 🤗Transformers	3	16450	December 20, 2021
Couple of questions about Trainer Beginners	0	329	June 13, 2023
How is compute_metrics working internally? Beginners	2	814	August 30, 2021
Trainer class, compute_metrics and EvalPrediction 🤗Transformers	6	14497	October 28, 2020

How is the eval dataset processed in a trainer?

Related topics