Make predictions with the Dropout on

MrRobot · March 27, 2021, 5:59am

The default behavior of Trainer(...) when evaluating model is disabling Dropout. Concretely, y_pred for M runs will be exactly the same

for i in range(M):
    logits, labels, metrics = trainer.predict(tokenized_datasets["eval"])
    y_pred = np.argmax(logits, axis=2)
    ...

Now I am trying to apply Monte Carlo Dropout trick introduced this this answer. This requires to turn the Dropout on while making predictions on the validation set.

I am wondering how I achieve this goal. Any input is appreciated

Yuti · July 17, 2021, 4:42pm

I don’t think this is possible with the Trainer class as it is, but you can derive this class and then change the relevant methods.

In your case, I think you need to change the evaluation_loop method and delete the model.eval() line.

I think it would be better to keep the model.eval() line and set only the Dropout layer to train mode like it is shown in this post.

There might be some more changes that you need to make.
You can find the Trainer source code here.

Hope this helps .

Topic		Replies	Views
Saving Models in Active Learning setting 🤗Transformers	1	635	December 6, 2022
Is model.eval() equivalent to setting dropout as 0? 🤗Transformers	0	1329	July 7, 2022
How to set up Trainer for a regression? 🤗Transformers	6	13936	April 13, 2024
Adding dropout in custom model, but setting dropout through .from_pretrained() 🤗Transformers	2	61	March 21, 2025
RewardTrainer Problem 🤗AutoTrain	6	188	February 1, 2025

Make predictions with the Dropout on

Related topics