Different Inference Results of Whisper on x86 and ARM CPU

PC-Chen · May 6, 2023, 12:07pm

Hello everyone,

I am working on the Whisper model inference and getting different results on two different CPU architectures. On an Intel CPU, I am able to get normal results. However, on an ARM CPU, the model is unable to predict normal tokens, resulting in empty text. After checking, I found that the GELU function output is different on both CPUs.

I am using the pre-trained Whisper-tiny.en model directly, and the Python and PyTorch versions are the same on both CPUs, with all relevant libraries installed.

The ARM CPU is Cortex-A53 on Xilinx KV260. The x86 CPU is Intel Core i7.

I would like to ask what could be the possible cause of this issue and how to resolve it? Thank you.

move47 · July 8, 2023, 12:54am

Hi,
I am facing the same issue. Did you get any way to make it work consistently across the CPUs? Thank you.

Topic		Replies	Views
Different inference speed for finetuned Whisper models Beginners	0	397	February 28, 2024
Finetuned whisper model translating instead of transcribing 🤗Transformers	2	734	December 31, 2023
Has Anyone Successfully Fine-Tuned Whisper for a Local Language for better accuracy Beginners	5	202	May 27, 2025
Different Inference Speed for same size models Models	0	389	August 29, 2021
Confusing Benchmark results Running whisper on 4080 Super vs A10 vs H100 🤗Transformers	0	457	April 22, 2024

Different Inference Results of Whisper on x86 and ARM CPU

Related topics