Difference between enable_model_cpu_offload and device_mode

khayamgondal · June 24, 2024, 1:26am

Whats the difference between the two? My understanding is with device_mode"auto" if gpu is available that will be used first and if the model is larger than once GPU fills up rest will be offloaded to the CPU. Does enable_model_cpu_offload achieve the same goal?

Topic		Replies	Views
Inference with CPU offload 🤗Accelerate	0	1604	August 10, 2023
Why am I out of GPU memory despite using device_map="auto"? 🤗Accelerate	3	17544	March 18, 2024
How to see what part of model are offloaded to CPU? 🤗Transformers	1	122	August 7, 2024
What is the behaviour of pipeline's `device_map="auto"`? Beginners	1	104	January 18, 2025
Device_map="auto" Beginners	5	19546	September 25, 2024

Difference between enable_model_cpu_offload and device_mode

Related topics