Convert ASR to ONNX

ierezell · February 12, 2021, 9:00pm

Hello,

It’s really nice that we can access Speech technology really easily thanks to the new models (like facebook/wav2vec2-base-960h).

However there is no included way to convert the model to ONNX.
No problem, we can juste take : This nice pytorch tutorial which works like a charm !

However, when using with let’s say NodeJs there is no way to have the tokenizer back because only the python library defines the Wav2Vec2Tokenizer…

How could we use it in other stacks ?

EDIT :
I went to check the source code of the Wav2Vec2Tokenizer, and it appears to do only a padding ? So I guess it would be doable in a small amount of time to replicate this behaviour in another language ? Like get raw array, pad it, convert it to onnx and run in the onnx model and then decode with some logic and vocab file.

Thanks in advance and have a great day.

Topic		Replies	Views
How to convert Speech Encoder Decoder to onnx 🤗Optimum	1	846	January 10, 2024
Exporting wav2vec model to ONNX Beginners	2	3300	January 14, 2022
Exporting model wav2vec2 not supported? 🤗Optimum	3	1244	August 10, 2023
How to perform tokenization on an ONNX model in JS? 🤗Tokenizers	0	839	May 6, 2022
Export whisper large model to ONNX and prediction 🤗Transformers	0	551	December 26, 2022

Convert ASR to ONNX

Related topics