Hugging Face Forums
torch.distributed.elastic.multiprocessing.errors.ChildFailedError
🤗Transformers
ekjot1999
January 12, 2023, 5:41pm
5
hi
@IdoAmit198
, i’m facing the same issue, have u resolved this issue?
1 Like
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: -9) local_rank: 3 (pid: 10561) of binary
show post in topic
Related topics
Topic
Replies
Views
Activity
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
🤗Accelerate
1
676
August 15, 2024
Run crash with all GPU's and success with less
🤗Transformers
0
423
December 12, 2022
Error when fine-tuning on multi-gpu
🤗Transformers
1
787
February 17, 2025
RuntimeError: arguments are located on different GPUs
🤗Transformers
2
1869
October 24, 2020
Errors when training on multi node single gpu
🤗Transformers
1
1776
February 25, 2022