Anywhere where I can read more about the `device_map` kwarg in `from_pretrained`?

seanswyi · January 5, 2024, 12:24am

I’ve recently been trying to work with large models and discovered that using device_map="auto" is an easy way to delegate loading large models onto GPU devices without me having to use .to with a lot of other configuration.

I want to learn a bit more about how to design device maps but it seems like the documentation doesn’t have it (I tried following the hyperlink from “Handling Big Models for Inference” but it leads to a 404 page).

I tried taking a look at the source code for modeling_utils.PretrainedModel.from_pretrained but am having some trouble getting a better grasp.

Is there anywhere else that I could look to learn more about how device mapping works?

JoPmt · January 5, 2024, 7:44am

system · January 7, 2024, 11:31pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Device_map="auto" Beginners	5	19829	September 25, 2024
Using device_map='auto' for training 🤗Accelerate	5	35982	January 24, 2025
Why am I out of GPU memory despite using device_map="auto"? 🤗Accelerate	3	17775	March 18, 2024
Automatically cast input to model's device map Beginners	0	308	March 11, 2024
Infer_auto_device_map returns empty 🤗Accelerate	2	3237	March 15, 2023

Anywhere where I can read more about the `device_map` kwarg in `from_pretrained`?

Related topics