Evaluation step very slow

theodp · February 20, 2024, 1:37pm

Hello

I am fine-tuning a foundation model on Language Modeling (Next token prediction task).

As you can see with my code bellow, evaluation is performed every 5 steps :

    train_args = transformers.TrainingArguments(
                    output_dir=output_dir,
                    warmup_steps=1,
                    per_device_train_batch_size=2,
                    gradient_accumulation_steps=1,
                    max_steps=max_steps,
                    learning_rate=2.5e-5, 
                    evaluation_strategy = "steps",   
                    eval_steps = 5,
                )

  trainer = transformers.Trainer(
      model = model,
      train_dataset = train,
      eval_dataset = eval,
      args = args,
      data_collator = data_collator,
  )

  trainer.train()

The training time for each step is normal, but evaluation step is very long !

Would you have any idea why it is so long ?

Thank you

El-chapoo · February 21, 2024, 5:00pm

if you visit hugging face training arguments the default batch size for training and eval is 8 you should override it by passing per device eval batch size to 2

Topic		Replies	Views
Evaluation step take longer then training step Intermediate	0	820	October 23, 2023
Evaluation became slower and slower during Trainer.train() Beginners	8	4588	February 3, 2025
Long wait time between evaluate and save (checkpoint creation) Beginners	9	520	September 16, 2024
Whisper fine-tuning slow eval Models	0	449	February 28, 2024
What is Transformers doing? Why it's so slow? 🤗Transformers	0	996	June 16, 2023

Evaluation step very slow

Related topics