Running out of System RAM while loading BLIP2 on Colab?

Kartikeya · November 15, 2023, 7:59am

I’m trying to load the BLIP2 model on Google Colab using the code below.

!pip install --quiet bitsandbytes
!pip install --quiet --upgrade transformers # Install latest version of transformers
!pip install --quiet --upgrade accelerate
!pip install --quiet sentencepiece

model_name = "blip2-opt-2.7b"

from transformers import AutoModelForSeq2SeqLM, AutoProcessor
from transformers import BlipProcessor, Blip2ForConditionalGeneration
from accelerate import Accelerator
import torch
accelerator = Accelerator()

model_id=f"Salesforce/{model_name}"
processor = AutoProcessor.from_pretrained(model_id, load_in_8bit=True)
model = Blip2ForConditionalGeneration.from_pretrained(model_id, load_in_8bit=True, device_map="auto", \
                                                      offload_state_dict=True, \
                                                      offload_folder="offload")
model = accelerator.prepare(model)

Even after using accelerate and setting device_map to “auto,” I’m unable to load the model. Moreover, while loading, it’s not utilizing the GPU RAM (on Colab, which is 15 GB) and exhausting the entire system RAM. Is there something that I’m missing?

Topic		Replies	Views
Cuda out of memory on Google Colab when running Blip2 Models	0	341	September 21, 2023
General question about large model loading 🤗Accelerate	2	917	November 28, 2024
Colab RAM Limit Exceeded: Unable to Run 3B Model Even with Quantization Beginners	0	1058	August 8, 2023
Colab's session crashed after using all available RAM when loading falcon-7B Beginners	2	1204	October 26, 2023
Llama-2 on colab Beginners	3	11379	November 28, 2023

Running out of System RAM while loading BLIP2 on Colab?

Related topics