Qwen/Qwen1.5-7B-Chat RuntimeError: The serialized model is larger than the 2GiB ORTModelForCausalLM

SantoshHF · January 1, 2025, 7:39am

from optimum.onnxruntime import ORTModelForCausalLM
base_model_name = “Qwen/Qwen1.5-7B-Chat”

ort_model = ORTModelForCausalLM.from_pretrained(
base_model_name,
use_io_binding=True,
export=True,
)

RuntimeError: The serialized model is larger than the 2GiB limit imposed by the protobuf library. Therefore the output file must be a file path, so that the ONNX external data can be written to the same directory. Please specify the output file name.

I provided file_name arg

ort_model = ORTModelForCausalLM.from_pretrained(
base_model_name,
use_io_binding=True,
export=True,
file_name=‘/qwen_exp/model.onnx’,
)

error:

Traceback (most recent call last):
File “/home/sr/test_qwen_ort.py”, line 11, in
ort_model = ORTModelForCausalLM.from_pretrained(
File “/home/sr//conda_env/anaconda3/envs/vaiq_onnx/lib/python3.9/site-packages/optimum/onnxruntime/modeling_ort.py”, line 737, in from_pretrained
return super().from_pretrained(
File “/home/sr//conda_env/anaconda3/envs/vaiq_onnx/lib/python3.9/site-packages/optimum/modeling_base.py”, line 438, in from_pretrained
return from_pretrained_method(
TypeError: _from_transformers() got an unexpected keyword argument ‘file_name’

SantoshHF · January 1, 2025, 7:42am

@regisss I tried the steps as in

John6666 · January 1, 2025, 9:23am

Perhaps bug?

github.com/huggingface/optimum

Cannot export jinaai models to onnx format because the model is > 2Gb

opened 01:18PM - 08 Apr 24 UTC

clarinevong

bug

### System Info ```shell Optimum Version: 1.18.0 Python Version: 3.9 Platform…: Windows, x86_64 ``` ### Who can help? @michaelbenayoun @JingyaHuang @echarlaix I am writing to report an issue I encountered while attempting to export a jinaai model to ONNX format using Optimum. Error message `RuntimeError: The serialized model is larger than the 2GiB limit imposed by the protobuf library. Therefore the output file must be a file path, so that the ONNX external data can be written to the same directory. Please specify the output file name.` ![image](https://github.com/huggingface/optimum/assets/55847721/67b72b36-c82d-4384-a87b-174fc8b1c1e4) ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially supported task in the `examples` folder (such as GLUE/SQuAD, ...) - [ ] My own task or dataset (give details below) ### Reproduction (minimal, reproducible, runnable) `optimum-cli export onnx -m jinaai/jina-embeddings-v2-base-en jina-embeddings-v2-base-en-onnx --trust-remote-code` ### Expected behavior I would expect Optimum to successfully export the jinaai model to ONNX format without encountering any errors or issues.

Topic		Replies	Views
Optimum Failed download of jina-embeddings-v2-base-es 🤗Optimum	4	391	August 19, 2024
Fail: [ONNXRuntimeError] : 1 : FAIL : Deserialize tensor onnx: 🤗Optimum	4	4747	December 7, 2022
ValueError: Please use the `disk_offload` function instead Beginners	1	944	August 21, 2024
Cannot export to ONNX with optimum.onnxruntime 🤗Optimum	0	891	February 28, 2024
Regarding Quantizing gpt2-xl, gpt2-large, &c 🤗Optimum	2	1341	August 10, 2022

Qwen/Qwen1.5-7B-Chat RuntimeError: The serialized model is larger than the 2GiB ORTModelForCausalLM

Related topics