Convert OpenAI whisper transformer model to Quantized tflite model

I was able to convert from Hugging face whisper onnx to tflite(int8) model,however am not sure how to run the inference on this model
Could anyone of you please review and let me know if there is anything i am missing in onnx to tflite conversion

1 Like

Ask ChatGPT :joy:.

I think you should take a look to the initial whisper code, which they used for inference and try to replicate it according to your model