Hugging Face Forums
torch.distributed.elastic.multiprocessing.errors.ChildFailedError
🤗Transformers
ekjot1999
January 12, 2023, 5:41pm
5
hi
@IdoAmit198
, i’m facing the same issue, have u resolved this issue?
1 Like
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: -9) local_rank: 3 (pid: 10561) of binary
show post in topic
Related topics
Topic
Replies
Views
Activity
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
🤗Accelerate
1
623
August 15, 2024
Error when fine-tuning on multi-gpu
🤗Transformers
1
637
February 17, 2025
Generate keeps increasing memory usage on ubuntu
🤗Transformers
6
56
May 25, 2025
Multi-GPU Training sometimes working with 2GPU, but never more than 2
🤗Accelerate
5
3017
August 8, 2024
Run crash with all GPU's and success with less
🤗Transformers
0
421
December 12, 2022