ValueError: weight is on the meta device when using Auto Model For Sequence Classification

Glick · October 15, 2023, 6:47pm

Trying to load Llama for classification, I get a ‘weight is on the meta device’ error.
Code:

model = AutoModelForSequenceClassification.from_pretrained('decapoda-research/llama-7b-hf', torch_dtype=torch.float16, device_map="auto", num_labels=3)

I get the error:

model = AutoModelForSequenceClassification.from_pretrained( File "/home/lab/user/anaconda3/envs/project/lib/python3.9/site-packages/transformers/models/auto/auto_factory.py",   line 471,  
 in from_pretrained return model_class.from_pretrained( File "/home/lab/user/anaconda3/envs/project/lib/python3.9/site-packages/transformers/modeling_utils.py", line 2846,   
in from_pretrained dispatch_model(model, device_map=device_map, offload_dir=offload_folder, offload_index=offload_index)   
File "/home/lab/user/anaconda3/envs/project/lib/python3.9/site-packages/accelerate/big_modeling.py", line 396,   
in dispatch_model attach_align_device_hook_on_blocks(   
File "/home/lab/user/anaconda3/envs/project/lib/python3.9/site-packages/accelerate/hooks.py", line 537,  
 in attach_align_device_hook_on_blocks attach_align_device_hook_on_blocks( File   "/home/lab/user/anaconda3/envs/project/lib/python3.9/site-packages/accelerate/hooks.py", line 507,  
 in attach_align_device_hook_on_blocks add_hook_to_module(module, hook) File  
 "/home/lab/user/anaconda3/envs/project/lib/python3.9/site-packages/accelerate/hooks.py", line 155,  
 in add_hook_to_module module = hook.init_hook(module) File  
 "/home/lab/user/anaconda3/envs/project/lib/python3.9/site-packages/accelerate/hooks.py", line 253,   
in init_hook set_module_tensor_to_device(module, name, self.execution_device) File "/home/lab/user/anaconda3/envs/project/lib/python3.9/site-packages/accelerate/utils/modeling.py", line 281,   
in set_module_tensor_to_device raise ValueError(f"{tensor_name} is on the meta device, we need a valueto put in on {device}.") ValueError: weight is on the meta device, we need avalue to put in on 6.

Importantly, loading the model with AutoModelForCausalLM instead of AutoModelForSequenceClassification works.
It seems as if AutoModelForSequenceClassification creates the classification layers empty on the meta device, and then crushes when trying to move them to GPU.
I tried loading everything directly into memory, without meta device, but couldn’t.

Glick · November 2, 2023, 9:54am

A workaround I came with is loading the classification model to cpu and then saving it once using accelerator.save_model. Afterwards the model can be loaded using load_checkpoint_and_dispatch

marcsun13 · November 30, 2023, 4:45pm

Hi @Glick, could you try with this model instead meta-llama/Llama-2-7b-hf ?
I’ve tested it on the latest transformers and I am unable to reproduce the issue.

from transformers import AutoModelForSequenceClassification
import torch
model = AutoModelForSequenceClassification.from_pretrained('meta-llama/Llama-2-7b-hf', 
                                                           torch_dtype=torch.float16, 
                                                           device_map='auto', 
                                                           num_labels=3)

Topic		Replies	Views
Issue with loading LLM for Text Classification in 8bit 🤗Transformers	0	663	April 14, 2023
Meta device error while instantiating model 🤗Accelerate	5	6980	April 1, 2025
Fine tune "meta-llama/Llama-2-7b-hf" Bug:RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0! (when checking argument for argument target in method wrapper_CUDA_nll_loss_forward) Beginners	15	189	December 6, 2024
HuggingFacePipeline Llama2 load_in_4bit from_model_id the model has been loaded with `accelerate` and therefore cannot be moved to a specific device 🤗Accelerate	2	7144	October 9, 2024
Runtime error: NotImplementedError: Cannot copy out of meta tensor; no data! 🤗Transformers	0	2100	May 7, 2024

ValueError: weight is on the meta device when using Auto Model For Sequence Classification

Related topics