Unable to load 8bit model in Kaggle with dual GPU

rcshubhadeep · March 26, 2023, 4:48am

Hello All,

I am trying to load meta/opt6.7B in Kaggle with dual T4 GPU. The code is simple and I could run that in colab. But I can’t make it work in Kaggle. What am I missing?

Here is the code -

!pip install  bitsandbytes datasets accelerate loralib
!pip install transformers peft

import os
# os.environ["CUDA_VISIBLE_DEVICES"]="0"
import torch
import torch.nn as nn
import bitsandbytes as bnb
from transformers import AutoTokenizer, AutoConfig, AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained(
    "facebook/opt-6.7b", 
    load_in_8bit=True, 
    device_map='auto',
)

tokenizer = AutoTokenizer.from_pretrained("facebook/opt-6.7b")

And here is the stack trace.

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
/tmp/ipykernel_23/3843057406.py in <module>
      9     "facebook/opt-6.7b",
     10     load_in_8bit=True,
---> 11     device_map='auto',
     12 )
     13 

/opt/conda/lib/python3.7/site-packages/transformers/models/auto/auto_factory.py in from_pretrained(cls, pretrained_model_name_or_path, *model_args, **kwargs)
    463             model_class = _get_model_class(config, cls._model_mapping)
    464             return model_class.from_pretrained(
--> 465                 pretrained_model_name_or_path, *model_args, config=config, **hub_kwargs, **kwargs
    466             )
    467         raise ValueError(

/opt/conda/lib/python3.7/site-packages/transformers/modeling_utils.py in from_pretrained(cls, pretrained_model_name_or_path, *model_args, **kwargs)
   2527         # Dispatch model with hooks on all devices if necessary
   2528         if device_map is not None:
-> 2529             dispatch_model(model, device_map=device_map, offload_dir=offload_folder, offload_index=offload_index)
   2530 
   2531         if output_loading_info:

TypeError: dispatch_model() got an unexpected keyword argument 'offload_index'

Any idea what can be done? I am a bit clueless. Any help is much appreciated.

rcshubhadeep · March 26, 2023, 5:05am

Ok this was super easy actually. I needed to upgrade the packages. Kaggle comes with pre-installed versions and they were old. So that is that. I have solved it

AayushShah · April 3, 2023, 10:48am

@rcshubhadeep
Hello, can you share which packages you upgraded, please? I am receiving the same issue!!

I have tried re-installing the transformers, but no luck.

rcshubhadeep · April 3, 2023, 10:51am

I forgot to update this thread but I could not make it work there despite many attempts, unfortunately. I will try later. To answer your question, I updated transformers, datasets etc. Kaggle run time comes with a older version that makes this error.

AayushShah · April 3, 2023, 11:04am

I have made it work…
Please try the following.

!pip uninstall wandb --yes

!pip install --upgrade git+https://github.com/huggingface/transformers.git@main

!pip install --upgrade bitsandbytes datasets accelerate loralib

!pip install --upgrade git+https://github.com/huggingface/peft.git

The key is to upgrade the packages. So, I did that and that worked for me.
Hope this helps.

rcshubhadeep · April 3, 2023, 11:15am

Thanks for the update. Will check it out!

Topic		Replies	Views
Issues with loading models in 8bit in Colab Beginners	0	329	December 13, 2023
General question about large model loading 🤗Accelerate	2	917	November 28, 2024
How can I run this code on Kaggle TPU? Runs fine with GPU 🤗Transformers	0	356	November 20, 2022
AutoModelForCausalLM error with accelerate and bitsandbytes 🤗Accelerate	1	1500	April 15, 2024
Error when fine-tuning on multi-gpu 🤗Transformers	1	557	February 17, 2025

Unable to load 8bit model in Kaggle with dual GPU

Related topics