Trocr after onnx quantisation conversion using optimum-cli , im getting this error

sainithish · May 19, 2024, 6:28am

first i tried to convert my model to onnx using this…
!optimum-cli export onnx -m best_M/ --task vision2seq-lm best_M_onnx/ --atol 1e-3

after its works perfectly when i tested with this converted onnxmodel, then i try to quantise this model using this…
!optimum-cli onnxruntime quantize --onnx_model best_M_onnx/ --avx512 -o best_M_quant/

this is succesfully quantise the model interms of size, but when im testing this error im getting…

Traceback (most recent call last):
  File "onnx_quant.py", line 213, in <module>
    test_ort()
  File "onnx_quant.py", line 159, in test_ort
    model = ORTModelForVision2Seq()
  File "onnx_quant.py", line 110, in __init__
    self.encoder = ORTEncoder()
  File "onnx_quant.py", line 38, in __init__
    self.session = onnxrt.InferenceSession(onnx_encoder, providers=["CPUExecutionProvider"]
  File "/home/sai/anaconda3/envs/onnx/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 419, in __init__
    self._create_inference_session(providers, provider_options, disabled_optimizers)
  File "/home/sai/anaconda3/envs/onnx/lib/python3.8/site-packages/onnxruntime/capi/onnxruntime_inference_collection.py", line 463, in _create_inference_session
    sess.initialize_session(providers, provider_options, disabled_optimizers)
onnxruntime.capi.onnxruntime_pybind11_state.NotImplemented: [ONNXRuntimeError] : 9 : NOT_IMPLEMENTED : Could not find an implementation for ConvInteger(10) node with name '/embeddings/patch_embeddings/projection/Conv_quant'```

plese help me and thanks in advance

regisss · May 20, 2024, 8:52am

Hi @sainithish! Can you provide the script you used to test the quantized model please?

Topic		Replies	Views
Optimum library optimization and quantization fails 🤗Optimum	8	1610	February 22, 2025
Onnx export functionality failure for facebook/opt-2.7b with optimum CLI 🤗Transformers	0	337	October 11, 2023
How to quantize and run inference for CLIP using optimum 🤗Optimum	1	249	June 3, 2024
Quantized Model size difference when using Optimum vs. Onnxruntime 🤗Optimum	3	1536	July 14, 2022
Optimize AND quantize with Optimum 🤗Optimum	11	3318	February 10, 2024

Trocr after onnx quantisation conversion using optimum-cli , im getting this error

Related topics