Hugging Face Forums
Inflated GPU memory footprint of model prepared via accelerate
🤗Accelerate
varadhbhatnagar
September 15, 2023, 4:40am
5
@tongyx361
Check this out :
Data Parallel Multi GPU Inference
1 Like
show post in topic
Related Topics
Topic
Replies
Views
Activity
`Accelerator.prepare` utilize only one GPU instead of all the 8 available GPUs and raises "CUDA out of memory"
🤗Accelerate
2
1638
July 13, 2023
Data Parallel Multi GPU Inference
🤗Accelerate
9
2902
September 15, 2023
Accelerator OOM
🤗Accelerate
2
769
July 5, 2023
FSDP accelerate.prepare gives OOM. How to load model into single GPU, then distribute shards?
🤗Accelerate
2
501
January 24, 2024
How to load large model with multiple GPU cards?
Beginners
8
22026
October 25, 2023