Hugging Face Forums
DeepSpeed error: a leaf Variable that requires grad is being used in an in-place operation
🤗Transformers
DeepSpeed
SSamDav
July 26, 2024, 3:27pm
2
I only get this error when I use
process_group_backend="gloo",
show post in topic
Related topics
Topic
Replies
Views
Activity
Gpt-neo inference with Deepspeed: IndexError: Dimension out of range
Beginners
0
483
August 10, 2021
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)
DeepSpeed
5
3473
August 26, 2024
Question about using trainer with DeepSpeed
🤗Transformers
0
462
April 25, 2023
How to use trainer with deepspeed
Beginners
0
342
January 12, 2024
[Deepspeed] ZeRO-Infinity integration released and config changes
DeepSpeed
2
2302
April 28, 2021