"Protobuf parsing failed" when onnxruntime opens a quantized model

idruker · March 25, 2024, 7:32pm

Hello

I used optimum 1.18 to create a quantized onnx model:

optimum-cli onnxruntime quantize --avx2 --onnx_model model.onnx --output quantized_model.onnx

Then I tried to open quantized_model.onnx with C++ API onnxruntime-1.17.1 shared library but it was rejected due to “Protobuf parsing failed”.

Notes. 1) the same onnxruntime binary is able to open and load non-quantized model.onnx. 2) the quantized model can be opened with Netron.

Anyone has an idea what I am missing?
Thank you

Topic		Replies	Views
Trocr after onnx quantisation conversion using optimum-cli , im getting this error 🤗Optimum	1	385	May 20, 2024
Optimum library optimization and quantization fails 🤗Optimum	8	1588	February 22, 2025
Onnx, Error: Failed to load model because protobuf parsing failed 🤗Transformers	1	1871	December 21, 2024
Onnx export functionality failure for facebook/opt-2.7b with optimum CLI 🤗Transformers	0	337	October 11, 2023
Quantized Model size difference when using Optimum vs. Onnxruntime 🤗Optimum	3	1532	July 14, 2022