I encountered the error below mentioned while attempting to merge the adapter with the base model. I have attached my code for fine-tuning the model. Any assistance in resolving the issue would be greatly appreciated

shreyash2023 · December 21, 2023, 12:02pm

notebook link.

RuntimeError: unable to mmap 9976576152 bytes from file </home/devuser/.cache/huggingface/hub/models–meta-llama–Llama-2-7b-chat-hf/snapshots/c1b0db933684edbfe29a06fa47eb19cc48025e93/model-00001-of-00002.safetensors>: Cannot allocate memory (12)

rigvedrs · February 21, 2024, 4:23am

Were you able to solve it? I am facing the same issue

shreyash2023 · February 21, 2024, 1:18pm

Yep, I performed the merge adapter part in a separate file and I loaded the model with low_cpu usage.

rigvedrs · February 22, 2024, 6:55am

Thanks!
But wouldn’t this offload the entire model on the CPU, even if you have a GPU?

shreyash2023 · February 22, 2024, 11:16am

I don’t think so. Model offload scenarios can be handled by using the ‘device_map’ argument.

Topic		Replies	Views
LLaMa 2 Error when merging adapters to base model Beginners	0	731	September 8, 2023
I was trying to fine tune llama2 for specific usecase.In that after fine tuning when I'm trying load fine tune model locally I'm getting error below mentioned 🤗Transformers	1	878	December 19, 2023
After llama fine tuning, model merging fails Beginners	1	33	May 20, 2025
Help with merging LoRA to base model Beginners	1	35	April 23, 2025
Runtime Error: Cuda Initialization 🤗Transformers	13	184	March 24, 2025

I encountered the error below mentioned while attempting to merge the adapter with the base model. I have attached my code for fine-tuning the model. Any assistance in resolving the issue would be greatly appreciated

Related topics