Can't load fine tuned LLamav2 7b

Stoemb · July 29, 2023, 12:25am

I fined tuned a Llama 2 7b model and uploaded it to hugging face but now when will load it in google colab I ran out of system ram. (fine tuned model: Stoemb/llama-2-7b-html2text)

I loaded the model as followed:
model_name = “Stoemb/llama-2-7b-html2text”
bnb_config = BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_quant_type=“nf4”,
bnb_4bit_compute_dtype=torch.bfloat16,
)

model = AutoModelForCausalLM.from_pretrained(
model_name,
quantization_config=bnb_config,
trust_remote_code=True
)
model.config.use_cache = False

Remmet · July 30, 2023, 2:44am

I’m still learning myself but i have been playing with a Llama 2 7B model in free Colab and I’ve found i need to set the device_map to “auto” as well as loading in 4 or 8 bit.

dannstrom · October 13, 2023, 10:40am

Don’t know if this is still an issue for you, but I had the same problem first time I fine-tuned the llama2 and tried to reload it.
The problem is that the default shard size when pushed to hub is 10GB, which is too much for the T4 GPU.

You can read more here:

To solve this you have to specify a smaller shard size as in the example below.

!huggingface-cli login
model.push_to_hub(your_model_name, max_shard_size='2GB')
tokenizer.push_to_hub(your_model_name)

This solved the problem I had at least, which sounds a lot similar to yours.

Topic		Replies	Views
Unable to load a FineTuned LLama Model to GPU for inference Beginners	3	2974	December 15, 2023
HuggingFacePipeline Llama2 load_in_4bit from_model_id the model has been loaded with `accelerate` and therefore cannot be moved to a specific device 🤗Accelerate	2	7129	October 9, 2024
Llama-2 on colab Beginners	3	11382	November 28, 2023
How to get a LLaMA v2 model with less than 7B parameters? Beginners	0	2114	August 24, 2023
I was trying to fine tune llama2 for specific usecase.In that after fine tuning when I'm trying load fine tune model locally I'm getting error below mentioned 🤗Transformers	1	878	December 19, 2023

Can't load fine tuned LLamav2 7b

Related topics