ValueError: model.embed_tokens.weight doesn't have any device set

reddiamond · May 8, 2023, 12:49pm

Hi, I am having this error and I don’t know what it means. Can someone explain?
This code triggers error:
model = LlamaForCausalLM.from_pretrained(
base_model,
#load_in_8bit=True,
torch_dtype=torch.float16,
device_map=device_map,
quantization_config=quantization_config,
)
device_map = {
“transformer.word_embeddings”: 0,
“transformer.word_embeddings_layernorm”: 0,
“lm_head”: “cpu”,
“transformer.h”: 0,
“transformer.ln_f”: 0,
}

quantization_config = BitsAndBytesConfig(llm_int8_enable_fp32_cpu_offload=True)

sainathpawar86 · June 21, 2023, 12:26am

I am also facing the same error. Please help with the fix

alexzhuzhou · June 23, 2023, 10:24pm

Same here. Any new?

pawankumark · July 19, 2023, 11:47am

while using device map add “model.embed_tokens”. In my case I have set everything to “cpu” since I don’t have any GPUs, you can set your’s as required.

you may need to add “model.layers” and “model.norm” also

device_map = {
“transformer.word_embeddings”: “cpu”,
“transformer.word_embeddings_layernorm”: “cpu”,
“lm_head”: “cpu”,
“transformer.h”: “cpu”,
“transformer.ln_f”: “cpu”,
“model.embed_tokens”: “cpu”,
“model.layers”:“cpu”,
“model.norm”:“cpu”
}

danielpark · October 11, 2023, 8:50am

In llama2-70b, we have encountered the same issue.

Does anyone know why this error occurs in the device_map, or perhaps have a clear solution?
If anyone has detailed documentation regarding llama2’s tokenizer, sharing it would be greatly appreciated.

nielsr · December 29, 2023, 1:34pm

Hi,

A device_map specifies where to place each of the individual parameters of the model. If a model supports it, it’s advised to use device_map="auto", which will automatically determine where to place each of the layers (using the priority of GPUs > CPUs > disk). Read more about it here: Handling big models for inference.

In your case, as you specify the device_map of the individual parameters, it will complain if you don’t include ALL of the model’s parameter names. In this case, the token embedding matrix of the LlamaForCausalLM model is called model.embed_tokens as can be seen here. Hence, the error will be raised because this was not specified in your device_map. I see you’ve specified transformer.word_embeddings, but that’s not the name of the token embedding matrix for this model.

Topic		Replies	Views
Runtime error when using device_map 🤗Transformers	1	1173	September 20, 2023
Tokenizer setting for model = LlamaForCausalLM.from_pretrained(model_path, device_map='auto') Models	0	1121	August 25, 2023
Setting up my custom device map for a LLM 🤗Transformers	3	5017	January 29, 2025
Loading pre-trained models with AddedTokens 🤗Transformers	2	739	October 14, 2024
ValueError: weight is on the meta device when using Auto Model For Sequence Classification 🤗Accelerate	2	1970	November 30, 2023

ValueError: model.embed_tokens.weight doesn't have any device set

Related topics