Error loading Llama model

XIAOXIAOFROZEN · December 27, 2023, 5:08pm

I got a problem loading the Llama-2-13b-hf model with the following code,

LlamaForCausalLM.from_pretrained(base_model, trust_remote_code=True, device_map = “cuda:0”, load_in_8bit = True),

A error returned as
ImportError: Using load_in_8bit=True requires Accelerate: pip install accelerate and the latest version of bitsandbytes pip install -i https://test.pypi.org/simple/ bitsandbytes or pip install bitsandbytes`.

But I am pretty sure the two package are installed with version bitsandbytes-0.41.3, accelerate 0.25.0. Anyone has encountered the issue before and know how to resolve it, thank you very much!

Topic		Replies	Views
ImportError using AutoModelForCasualLM.from_pretrained Beginners	0	528	April 30, 2024
Llama2-70b SafetensorError: Error while deserializing header: HeaderTooLarge 🤗Transformers	0	1181	December 9, 2023
TypeError: LlamaForCausalLM.__init__() got an unexpected keyword argument 'load_in_4bit' 🤗Transformers	7	20535	October 7, 2023
Issues with loading models in 8bit in Colab Beginners	0	335	December 13, 2023
Could not load model meta-llama/Llama-2-7b-chat-hf with any of the following classes 🤗Transformers	22	50018	December 19, 2024

Error loading Llama model

Related topics