Different classification label produced by 'predictions' and 'label_ids' from Trainer.predict()

cairo2050 · May 9, 2022, 4:10pm

What does predictions and label_ids actually mean from Trainer.predict()?

I trained a multilabel classification model and tested it on a test dataset. The Trainer.predict.metrics gave me the output below:

I thought label_ids should be the predicted label so I did a confusion matrix between label_ids and my testing data. The result shows a perfect prediction with accuracy = 1, recall =1, precision = 1 etc.

I realized something was wrong so I computed the label myself with the logit values produced by Trainer.predict.predictions:
compute_labels = tf.round(tf.nn.sigmoid(test_prediction.predictions))

Running confusion matrix with the compute_labels and the test data, I am able to get a reasonable prediction results that replicated the output of Trainer.predict.metrics (i.e., above image).

My question is: What does Trainer.predict.label_ids mean? Why the output I got from this argument produced a perfect prediction results which was obviously wrong?

Thank you in advance

datatrigger · June 28, 2022, 2:53pm

Hello, I too naïvely thought that Trainer.predict.label_ids were the predicted labels. But they are the actual labels, as we can read in this tutorial. You need to actually retrieve the predicted labels by yourself using np.argmax(test_prediction.predictions, axis=-1). As we can read later on in the tutorial, metrics are computed like so:

metric = load_metric("glue", "mrpc")
metric.compute(predictions=preds, references=predictions.label_ids)

Which confirms that label_ids are the actual labels indeed. I also find this very confusing.

Topic		Replies	Views
EvalPrediction has an unequal number of label_ids and predictions 😫 🤗Transformers	3	1319	June 19, 2024
Trainer.predict return predictions=None Beginners	1	215	April 10, 2024
Trainer class, compute_metrics and EvalPrediction 🤗Transformers	6	14537	October 28, 2020
Eval_pred vs. EvalPrediction confusion 🤗Transformers	0	872	August 5, 2023
Trainer.evaluate() vs trainer.predict() 🤗Transformers	6	36756	July 10, 2024

Different classification label produced by 'predictions' and 'label_ids' from Trainer.predict()

Related topics