Error while trying to Load the "deepseek-ai/DeepSeek-V3" model

neeraj1909 · February 18, 2025, 4:21am

I am using the following code to run the DeepSeek-V3

My code

!pip install torch==2.4.1
!pip install torchvision==0.19.1
!pip install triton==3.0.0
# !pip install transformers==4.46.3
!pip install transformers==4.36.2
!pip install bitsandbytes==0.41.2
!pip install safetensors==0.4.5
!pip install accelerate>=0.26.0

!git clone https://github.com/deepseek-ai/DeepSeek-V3.git

cd DeepSeek-V3/inference

!pip install -r requirements.txt

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig

quantization_config = BitsAndBytesConfig(
    load_in_4bit=True,  
    bnb_4bit_compute_dtype="float16",
    bnb_4bit_use_double_quant=True
)

model = AutoModelForCausalLM.from_pretrained(
    "deepseek-ai/DeepSeek-V3",
    quantization_config=quantization_config,
    device_map="auto",
    trust_remote_code=True
)

tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/DeepSeek-V3")

Error I am getting:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
Cell In[9], line 9
      1 from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
      3 quantization_config = BitsAndBytesConfig(
      4     load_in_4bit=True,  
      5     bnb_4bit_compute_dtype="float16",
      6     bnb_4bit_use_double_quant=True
      7 )
----> 9 model = AutoModelForCausalLM.from_pretrained(
     10     "deepseek-ai/DeepSeek-V3",
     11     quantization_config=quantization_config,
     12     device_map="auto",
     13     trust_remote_code=True
     14 )
     16 tokenizer = AutoTokenizer.from_pretrained("deepseek-ai/DeepSeek-V3")

File /home/zeus/miniconda3/envs/cloudspace/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py:559, in _BaseAutoModelClass.from_pretrained(cls, pretrained_model_name_or_path, *model_args, **kwargs)
    557     cls.register(config.__class__, model_class, exist_ok=True)
    558     model_class = add_generation_mixin_to_remote_model(model_class)
--> 559     return model_class.from_pretrained(
    560         pretrained_model_name_or_path, *model_args, config=config, **hub_kwargs, **kwargs
    561     )
    562 elif type(config) in cls._model_mapping.keys():
    563     model_class = _get_model_class(config, cls._model_mapping)
...
    100     )
    102 target_cls = AUTO_QUANTIZATION_CONFIG_MAPPING[quant_method]
    103 return target_cls.from_dict(quantization_config_dict)

ValueError: Unknown quantization type, got fp8 - supported types are: ['awq', 'bitsandbytes_4bit', 'bitsandbytes_8bit', 'gptq', 'aqlm', 'quanto', 'eetq', 'hqq', 'compressed-tensors', 'fbgemm_fp8', 'torchao', 'bitnet']

Could you please help me resolve this issue?

Useful References:

John6666 · February 18, 2025, 5:37am

Loading with fp8 was not supported due to various problems. It seems that it will be supported soon. Perhaps it can already be used with the github version.

github.com/huggingface/transformers

Unknown quantization type, got fp8

opened 08:34PM - 31 Dec 24 UTC

ruidazeng

bug

### System Info - `transformers` version: 4.47.1 - Platform: macOS-15.1.1-arm64…-arm-64bit - Python version: 3.10.16 - Huggingface_hub version: 0.27.0 - Safetensors version: 0.4.5 - Accelerate version: 1.2.1 - Accelerate config: not found - PyTorch version (GPU?): 2.5.1 (False) - Tensorflow version (GPU?): not installed (NA) - Flax version (CPU?/GPU?/TPU?): not installed (NA) - Jax version: not installed - JaxLib version: not installed ### Who can help? @SunMarc @MekkCyber ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks - [ ] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...) - [X] My own task or dataset (give details below) ### Reproduction Issue arises when using AutoModelForCasualLM.from_pretrained() The model used is `"deepseek-ai/DeepSeek-V3"` File "/Users/ruidazeng/Demo/chatbot.py", line 13, in init self.model = AutoModelForCausalLM.from_pretrained( File "/opt/anaconda3/envs/gaming-bot/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 559, in from_pretrained return model_class.from_pretrained( File "/opt/anaconda3/envs/gaming-bot/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3659, in from_pretrained config.quantization_config = AutoHfQuantizer.merge_quantization_configs( File "/opt/anaconda3/envs/gaming-bot/lib/python3.10/site-packages/transformers/quantizers/auto.py", line 173, in merge_quantization_configs quantization_config = AutoQuantizationConfig.from_dict(quantization_config) File "/opt/anaconda3/envs/gaming-bot/lib/python3.10/site-packages/transformers/quantizers/auto.py", line 97, in from_dict raise ValueError( ValueError: Unknown quantization type, got fp8 - supported types are: ['awq', 'bitsandbytes_4bit', 'bitsandbytes_8bit', 'gptq', 'aqlm', 'quanto', 'eetq', 'hqq', 'compressed-tensors', 'fbgemm_fp8', 'torchao', 'bitnet'] ### Expected behavior To be able to run Deepseek-R1

!pip install git+https://github.com/huggingface/transformers

jmcinern · April 14, 2025, 2:43pm

has anyone found a solution to this?

John6666 · April 14, 2025, 2:57pm

It seems resolved in dev version.

Yes the deepseek-v3 was integrated here #35926, and fp8 here #36026

Topic		Replies	Views
Download DeepSeek R1 685B locally for future fine tuneing 🤗Transformers	2	2019	January 31, 2025
An error i ve been trying to fix for days now Intermediate	4	437	November 19, 2024
AutoModelForCausalLM error with accelerate and bitsandbytes 🤗Accelerate	1	1518	April 15, 2024
ImportError using AutoModelForCasualLM.from_pretrained Beginners	0	492	April 30, 2024
Error while fine-tuning distilbert model Models	1	207	November 17, 2024

Error while trying to Load the "deepseek-ai/DeepSeek-V3" model

My code

Useful References:

Related topics