You can’t use load_best_model_at_end=True
if you don’t want to save checkpoints: it needs to save checkpoints at every evaluation to make sure you have the best model, and it will always save 2 checkpoints (even if save_total_limit
is 1): the best one and the last one (to resume an interrupted training).
11 Likes