Using 2 GPUs out of 4

AmdGoose · February 28, 2024, 5:39pm

Hi,

I’m doing inference on falcon 40B.
I have 4 GPU on my server.
when I do : model = AutoModelForCausalLM.from_pretrained(model_id, cache_dir=‘./workspace/’,
torch_dtype=torch.bfloat16, trust_remote_code=True, device_map=auto, offload_folder=“offload”)
it is using the 4 GPU.
When I do : model = AutoModelForCausalLM.from_pretrained(model_id, cache_dir=‘./workspace/’,
torch_dtype=torch.bfloat16, trust_remote_code=True, device_map={“” :0}, offload_folder=“offload”)
it is using only 1 GPU

Now I don’t now the syntax of device_map to tell him to use only 2 GPU (0 and 1).

thanks for your help

Topic		Replies	Views
Why is Trainer only using 1 (not 4) GPUs? Beginners	1	1635	June 2, 2022
Constrain device map to GPUs 🤗Accelerate	0	1308	February 24, 2023
How to load model on multiple GPUs for inference? Beginners	0	765	September 28, 2023
Gpt-neo 27 and 13 Models	2	843	June 18, 2021
How to All Utilize all GPU's when device="balanced_low_0" in GPU setting 🤗Transformers	1	217	March 28, 2024

Using 2 GPUs out of 4

Related topics