I’m trying to solve a doubt that I saw a few other people have asked about, but couldn’t find an answer.
The question is quite simple: when using
trainer.push_to_hub() together with the
load_best_model_at_end argument, is the trainer pushing the last or the best model?
My doubt is raised by the fact that the automatically-created model card reports the selected score metric for the last epoch, instead of the one obtained at the best one.
I am reasonably sure is not an issue of loading the best model, since I’ve tested via
trainer.evaluate() that after training the trainer is, in fact using the best-obtained model.
Thank you so very much for any help/clarification, and the great work you’re all doing!