Early stopping + trainer + hub

silvia-casola · October 31, 2023, 2:52pm

Hi! I am trying to fine-tune a model with early stopping using trainer and then publish it on the hub. However, from the automatically created model card, it looks like the updated model is the last one and not the best one.

See, for example, this model.

The arguments I use are:

training_args = TrainingArguments(
        output_dir="irony_"+language+"_"+"_".join(nationality.split()),          
        num_train_epochs=20,  
        learning_rate = 5e-06,
        per_device_train_batch_size=16,  
        per_device_eval_batch_size=64,   
        evaluation_strategy="epoch",
        save_strategy="epoch",
        save_total_limit=2,
        logging_strategy="epoch",
        overwrite_output_dir = True,
        load_best_model_at_end = True,
    )

I then train using the trainer, and push to the hub using

    trainer.push_to_hub("irony_"+language+"_"+"_".join(nationality.split()))

I use transformers 4.34.1

lysandre · November 1, 2023, 9:11am

Thanks for your question @silvia-casola!

Pinging @muellerzr and @smangrul for advice

silvia-casola · November 6, 2023, 12:38pm

Hi @muellerzr @smangrul , any advice?

Detsutut · January 17, 2024, 3:15pm

The trainer pushes the best model, but the automatically-created card reports the performance of the last one.

Check this: Clarification on push_to_hub, best model, and model card

Topic		Replies	Views
Autogenerated model cards not showing the best metrics when using "load_best_model_at_end=True" 🤗Hub	0	531	December 24, 2022
How to push save trainer + model to hub? 🤗Hub	0	1500	June 13, 2022
Clarification on push_to_hub, best model, and model card 🤗Hub	3	1547	January 2, 2025
Early Stopping saving second best model, not first Beginners	0	437	August 16, 2023
Unexpected behavior of load_best_model_at_end in Trainer (or am I doing it wrong?) 🤗Transformers	2	58	March 25, 2025

Early stopping + trainer + hub

Related topics