Confusion about trainer.predict(dataset['test']) output

SiddharthaM · November 3, 2022, 10:15pm

Dear everyone,
Hello, im learning how to fine-tune a Transformer model.
We predict the outputs of a fine-tuned model using predictions = trainer.predict(dataset[‘test’]). Is predictions.predictions the output logits of the Fine-Tuned Transformer model? or is it something else?

And to calculate the output probabilities of the model, im using the following code

import tensorflow as tf
predictions = trainer.predict(dataset["test"])
prediction_proba = tf.math.softmax(predictions.predictions, axis=-1)

Is it the correct method to find out the prediction probabilities of a model’s output?

Topic		Replies	Views
Trainer.predict return predictions=None Beginners	1	215	April 10, 2024
Returning logits from Trainer.predict() Beginners	3	2369	August 31, 2021
What does the output of Seq2SeqTrainer predict.predictions refer to and how to get generated summaries Beginners	4	1256	October 19, 2023
Transform Logits to probabilities doesn't work Beginners	4	9372	February 17, 2022
Getting outputs of mode.predict() per sentence input Models	3	2436	June 21, 2021

Confusion about trainer.predict(dataset['test']) output

Related topics