Early_stopping_patience param in EarlyStoppingCallback

Henry128 · May 10, 2022, 9:30am

Hi there,
I am quite confused about the early_stopping_patience in EarlyStoppingCallback.
Is it related to the evaluation_strategy in TrainingArgs?
For example, when the evaluation_strategy=‘epoch’ and early_stopping_patience=8 in TrainingArgs, the training will stop if the metrics/ loss does not improve/reduce after 8 epochs? And works the same when evaluation_strategy=‘steps’.

aomar85 · May 10, 2022, 11:13am

EarlyStoppingCallback is related with evaluation_strategy and metric_for_best_model.

early_stopping_patience ( int ) — Use with metric_for_best_model to stop training when the specified metric worsens for early_stopping_patience evaluation calls.

I was confused too whether to use it with evaluation_strategy=steps or epochs, but after some trials, I realized that it better to use it with epochs to grantee that model is trained on the whole dataset

Mou11209203 · April 15, 2024, 1:58pm

If you use early_stopping_patience in EarlyStoppingCallback, must:

Pass a function make evaluaton dict to compute_metrics param in Trainer class
Use metric_for_best_model to set evaluation key in compute_metrics like: mae, mse…
Use greater_is_better to set this evaluation object greater or lower is better. For mae or mse, lower is better.

My code:

**def compute_metrics(eval_pred):**
            predictions, labels = eval_pred
            predictions = predictions[:, 0]
            mse = mean_squared_error(labels, predictions)
            mae = mean_absolute_error(labels, predictions)
            **return {"mse": mse, "mae": mae}**

        training_args = TrainingArguments(
            output_dir=f"{model_path.split('/')[-1]}_regression_finetuned_{output_name}",
            evaluation_strategy="epoch",
            save_strategy="epoch",
            save_total_limit=2,
            learning_rate=3e-5,
            per_device_train_batch_size=16,
            per_device_eval_batch_size=16,
            num_train_epochs=10,
            weight_decay=0.01,
            load_best_model_at_end=True,
            **metric_for_best_model="mae",**
            **greater_is_better=False,**
            warmup_steps=warmup_steps,
            lr_scheduler_type='cosine',
            logging_dir='./logs',
            logging_steps=50,
            push_to_hub=True,
            run_name='run_cosine_decay_regression',
            fp16=False,
            report_to="none"
        )

        trainer = Trainer(
            model=model,
            args=training_args,
            train_dataset=tokenized_dataset['train'],
            eval_dataset=tokenized_dataset['test'],
            **compute_metrics=compute_metrics**,
            tokenizer=tokenizer,
            data_collator=data_collator,
            **callbacks=[EarlyStoppingCallback(early_stopping_patience=2)]**
        )

Topic		Replies	Views
When is the EarlyStoppingCallback being called? Beginners	1	439	April 13, 2022
Problem with EarlyStoppingCallback 🤗Transformers	13	10634	April 4, 2024
Early stopping callback problem Beginners	2	8329	April 22, 2021
[Maybe Bug] When using EarlyStopping Callbacks with Seq2SeqTraininer, training didn't stop DeepSpeed	3	1536	April 4, 2024
Early stopping training using Validation loss as the metric for best model Beginners	1	8745	February 9, 2023

Early_stopping_patience param in EarlyStoppingCallback

Related topics