Hugging Face Forums
torch.distributed.elastic.multiprocessing.errors.ChildFailedError
🤗Transformers
ekjot1999
January 12, 2023, 5:41pm
5
hi
@IdoAmit198
, i’m facing the same issue, have u resolved this issue?
1 Like
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: -9) local_rank: 3 (pid: 10561) of binary
show post in topic
Related topics
Topic
Replies
Views
Activity
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
🤗Accelerate
1
541
August 15, 2024
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: -9) local_rank: 3 (pid: 10561) of binary
🤗Accelerate
4
4735
January 24, 2024
Errors when training on multi node single gpu
🤗Transformers
1
1727
February 25, 2022
Multi-GPU Distributed Training using Accelerate on Windows
🤗Accelerate
0
1520
August 9, 2023
RAM memory issues while training with torch.distributed.launch
🤗Transformers
1
1008
October 19, 2022