Dear everyone,
Hello, im learning how to fine-tune a Transformer model.
We predict the outputs of a fine-tuned model using predictions = trainer.predict(dataset[‘test’]). Is predictions.predictions the output logits of the Fine-Tuned Transformer model? or is it something else?
And to calculate the output probabilities of the model, im using the following code
import tensorflow as tf
predictions = trainer.predict(dataset["test"])
prediction_proba = tf.math.softmax(predictions.predictions, axis=-1)
Is it the correct method to find out the prediction probabilities of a model’s output?