Error loading Llama model

XIAOXIAOFROZEN · December 27, 2023, 5:08pm

I got a problem loading the Llama-2-13b-hf model with the following code,

LlamaForCausalLM.from_pretrained(base_model, trust_remote_code=True, device_map = “cuda:0”, load_in_8bit = True),

A error returned as
ImportError: Using load_in_8bit=True requires Accelerate: pip install accelerate and the latest version of bitsandbytes pip install -i https://test.pypi.org/simple/ bitsandbytes or pip install bitsandbytes`.

But I am pretty sure the two package are installed with version bitsandbytes-0.41.3, accelerate 0.25.0. Anyone has encountered the issue before and know how to resolve it, thank you very much!

nielsr · December 27, 2023, 6:38pm

Hi,

Are you running in Google Colab? If yes, restarting the runtime may help.

XIAOXIAOFROZEN · December 27, 2023, 7:21pm

Hi, nielsr, I am running it in AWS sagemaker, I found a solution in forum by downgrading transformers to 4.30.0, the error disappears but a new error emerge: SafetensorError: Error while deserializing header: HeaderTooLarge, wonder if you have any clue on that, thank you!

nielsr · December 27, 2023, 7:35pm

It should also work with the latest Transformers version. Could you try creating a new environment?

XIAOXIAOFROZEN · December 27, 2023, 8:14pm

Yes, latest version of transformers also pass that error, thank you very much!

akshat-kumar-akight · March 9, 2024, 8:14am

Hi, were you able to resolve the error?

Topic		Replies	Views
ImportError using AutoModelForCasualLM.from_pretrained Beginners	0	500	April 30, 2024
Llama2-70b SafetensorError: Error while deserializing header: HeaderTooLarge 🤗Transformers	0	1171	December 9, 2023
TypeError: LlamaForCausalLM.__init__() got an unexpected keyword argument 'load_in_4bit' 🤗Transformers	7	20328	October 7, 2023
Issues with loading models in 8bit in Colab Beginners	0	329	December 13, 2023
Could not load model meta-llama/Llama-2-7b-chat-hf with any of the following classes 🤗Transformers	22	49769	December 19, 2024

Error loading Llama model

Related topics