Plot Loss Curve with Trainer()

marlon89 · September 7, 2021, 8:28am

Hey,

I am fine tuning a BERT model for a Multiclass Classification problem. While training my losses seem to look a bit “unhealthy” as my validation loss is always smaller (eval_steps=20) than my training loss. How can I plot a loss curve with a Trainer() model?

scottire · September 11, 2021, 7:58pm

Scott from Weights & Biases here. Don’t want to be spammy so will delete this if it’s not helpful. You can plot losses to W&B by passing report_to to TrainingArguments.

from transformers import TrainingArguments, Trainer

args = TrainingArguments(... , report_to="wandb")
trainer = Trainer(... , args=args)

More info here: Logging & Experiment tracking with W&B

marlon89 · September 13, 2021, 7:09am

Hey Scott,
I think its helpful but I already do that. Anyway I want to find a way to directly plot the Losses in my notebook… . Any idea how to achieve that? Cheers

Pablogps · September 13, 2021, 11:52am

Note that validation losses being smaller than train is not necessarily bad or weird when working with advanced architectures and techniques, since you are not really comparing equivalent things. For example, consider dropout, that “cancels” some connections at train, while using all during evaluation (validation).

marlon89 · September 13, 2021, 12:10pm

I trained a few other Bert models and it seems that all models need a few steps (up to 50) till the train loss becomes lower compared to the validation loss. Even with different random states etc. Do you think I do not really have to worry? I mean after those “starting problems” the losses behave normal/healthy for my taste (0.3 vs 0.6 when finished with early stopping)

Pablogps · September 13, 2021, 12:30pm

I obviously can’t say! But the fact that val loss is lower than train would not be a big concern to me! How those losses evolve seems more important. And of course if the model performance actually improves with time, that’s also more relevant! (You can see this in downstream tasks if training a language model).

scottire · September 13, 2021, 2:08pm

You should be able to use %%wandb at the beginning of your training loop cell to see the live graphs in the output.
See: Tracking Jupyter Notebooks

gooohjy · November 10, 2021, 10:54am

Hey scottire, is it possible for me to obtain the training metrics and load them into a pandas dataframe? I’m looking to plot these scores in matplotlib so that I can compare with models trained with other frameworks.

Also, for using wandb is there a way for me to view the plot against epochs rather than steps?

Cheers!

scottire · November 23, 2021, 9:29pm

Yep that’s possible with the wandb API, see here: https://docs.wandb.ai/guides/track/public-api-guide#querying-multiple-runs


import pandas as pd 
import wandb

api = wandb.Api()
entity, project = "<entity>", "<project>"  # set to your entity and project 
runs = api.runs(entity + "/" + project) 

summary_list, config_list, name_list = [], [], []
for run in runs: 
    # .summary contains the output keys/values for metrics like accuracy.
    #  We call ._json_dict to omit large files 
    summary_list.append(run.summary._json_dict)

    # .config contains the hyperparameters.
    #  We remove special values that start with _.
    config_list.append(
        {k: v for k,v in run.config.items()
         if not k.startswith('_')})

    # .name is the human-readable name of the run.
    name_list.append(run.name)

runs_df = pd.DataFrame({
    "summary": summary_list,
    "config": config_list,
    "name": name_list
    })

marlon89 · November 24, 2021, 8:23am

Super cool! Thank you so much

Topic		Replies	Views
Ploting loss trend of fined-tuned model Beginners	1	2203	February 25, 2022
Plotting train accuracy and loss with Trainer 🤗Transformers	2	3257	February 27, 2024
How do i get Training and Validation Loss during fine tuning 🤗Transformers	2	14709	August 27, 2021
Trainer does not print to console the loss (train and eval) Beginners	0	1746	June 24, 2023
Loss function charting from trainer Beginners	1	389	January 21, 2024

Plot Loss Curve with Trainer()

Related topics