Weights and biases not showing train loss correctly

glai9665 · December 6, 2021, 11:28pm

Hi all, I’m training a binary text classification model. In order to debug, I’m training and evaluating on a small subset of the data (around 16 data points) to see if the model can successfully overfit. However, the train_loss logged to Weights and Biases are not showing correctly – as you can see from the screenshot, it’s just a single point. Any idea on why this happened?

Below are my training code:

model = AutoModelForSequenceClassification.from_pretrained("roberta-large")

training_args = TrainingArguments(
    output_dir='./results',
    learning_rate=1e-3,
    per_device_train_batch_size=4,
    per_device_eval_batch_size=4,
    num_train_epochs=5,
    evaluation_strategy="epoch",
    logging_steps=1,
    # weight_decay=0.01,
)


trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=encoded_ds,
    eval_dataset=encoded_ds,
    tokenizer=tokenizer,
    compute_metrics=compute_metrics,
    # data_collator=data_collator,
)

morgan · December 7, 2021, 2:00pm

What happens if you drop your batch size (both train and eval) to 1? And set

max_steps = 16,
evaluation_strategy="steps"

Not sure if it will work fully, but might help diagnose what is going on

ivangrov · December 7, 2021, 7:19pm

Hey! After doing some digging, I think you just need to scroll through the nine panels available (only 6 out of 9 displayed on your screenshot) and you’ll find the actual training loss under train/loss and not train/train_loss. W&B logs different things that the HuggingFace trainer sends it, and train/train_loss appears to be some single value that gets sent at the end of training, and that’s why it just looks like a dot. Let me know if it solves it or if you have other questions

Topic		Replies	Views
Training with class weights 🤗Transformers	5	2835	November 18, 2023
Trainer not logging eval_loss Beginners	2	910	April 26, 2021
Training Loss Higher than Validation Loss Beginners	0	432	August 3, 2022
No log for validation loss in trainer.train() Beginners	4	6093	April 13, 2024
Questions about my first code on fine-tuning BERT model for text-classification Beginners	0	1507	April 26, 2022

Weights and biases not showing train loss correctly

Related topics