Loading a model from local with best checkpoint

sgugger · October 30, 2020, 4:26pm

I don’t understand the question. With load_best_model_at_end the model loaded at the end of training is the one that had the best performance on your validation set. So when you save that model, you have the best model on this validation set.

If it’s crap on another set, it means your validation set was not representative of the performance you wanted and there is nothing we can do on Trainer to fix that.

Topic		Replies	Views
Question Regarding trainer arguments:: load_best_model_at_end Beginners	2	1948	April 19, 2021
Do trainer.save_model saves the best model? 🤗Transformers	3	6362	July 3, 2023
Does checkpoint have memory in the case of resume from checkpoint Beginners	0	222	February 28, 2024
Checkpoint vs model weight Beginners	2	4775	October 12, 2020
Differences in prediction from train end to checkpoint Beginners	3	837	September 11, 2023

Loading a model from local with best checkpoint

Related topics