RuntimeError when loading custom ONNX model exported from Whisper

FaresElmenshawi · March 17, 2024, 3:34pm

I’ve successfully exported a custom Whisper model to ONNX format using the example code provided in the Custom Export of Transformers Models documentation. The export process completed without errors, and the model was saved as custom_whisper_onnx.

However, when trying to load the model with the following code:

processor = AutoProcessor.from_pretrained("custom_whisper_onnx")
model = ORTModelForSpeechSeq2Seq.from_pretrained("custom_whisper_onnx")

I encounter a RuntimeError:

RuntimeError: Could not find the past key values in the provided model.

The traceback points to the initialization of the ORTModelForSpeechSeq2Seq class.

When I try to load a pre-exported model by Hugging Face, such as optimum/whisper-tiny.en, it works without any issues:

processor = AutoProcessor.from_pretrained("optimum/whisper-tiny.en")
model = ORTModelForSpeechSeq2Seq.from_pretrained("optimum/whisper-tiny.en")

Topic		Replies	Views
Fail: [ONNXRuntimeError] : 1 : FAIL : Deserialize tensor onnx: 🤗Optimum	4	4788	December 7, 2022
When exporting seq2seq models with ONNX, why do we need both decoder_with_past_model.onnx and decoder_model.onnx? 🤗Optimum	12	4570	March 7, 2024
How to use export-onnx.py to change the pytorch_model.bin to onnx? Beginners	1	26	March 12, 2025
Error exporting T5 model to ONNX with optimum-cli 🤗Optimum	3	802	May 7, 2024
Cannot export to ONNX with optimum.onnxruntime 🤗Optimum	0	910	February 28, 2024

RuntimeError when loading custom ONNX model exported from Whisper

Related topics