Hugging Face Forums
How to run an end to end example of distributed data parallel with hugging face's trainer api (ideally on a single node multiple gpus)?
Intermediate
aalexandrov
September 6, 2023, 4:39pm
18
@cyt79
Have you solved this, I am encountering the same phenomenon as well?
show post in topic
Related topics
Topic
Replies
Views
Activity
Distributed training large models on cloud resources
Beginners
6
859
March 27, 2024
Single Node Multi GPU FlanT5 fine-tuning using HF Dataset and HF Trainer
🤗Transformers
4
2089
July 5, 2023
Using Transformers with DistributedDataParallel — any examples?
Intermediate
11
23783
May 8, 2023
Multi gpu training
🤗Transformers
3
6080
April 24, 2022
Training using multiple GPUs
Beginners
20
20271
February 25, 2024