Trainer being very slow to init training setting group_by_length to True

Thytu · March 10, 2024, 1:49pm

I observe a very long time before the training actually starts once Trainer.train being called.
It appears it comes from LengthGroupedSampler used when setting group_by_length to True.

Is there a way to use multiple workers to accelerate this process?

GuyShur · February 1, 2025, 10:06am

Hey, did you manage to solve this?

Topic		Replies	Views
It takes so long before the model start training, wav2vec2 fine-tuning 🤗Transformers	2	2219	April 12, 2021
Training with Trainer really slow 🤗Transformers	0	1624	June 12, 2023
Grouping by length makes training loss oscillate and makes evaluation loss worse 🤗Transformers	2	239	June 3, 2025
Why is Trainer single-threaded during "Generating split..."? 🤗Transformers	0	291	April 17, 2024
Is the Trainer slower than customised loops? 🤗Transformers	3	33	July 4, 2025

Trainer being very slow to init training setting group_by_length to True

Related topics