Wav2vec2-xls-r-2b-22-to-16 sample code not running

thomasdaryl · November 26, 2021, 4:31pm

@patrickvonplaten
I am currently trying to run this model - facebook/wav2vec2-xls-r-2b-22-to-16

The example code given using the pipeline is giving significantly different results compared to the api hosted on hugging face. I recorded and sent the same audio to the api through the website as well as ran the sample code on colab. The output is quite different.
I ran the output even using the patrickvonplaten/librispeech_asr_dummy dataloaded and i checked the audio, the output is different from the text translation.

I tried running using the second step-by-step method too, it fails with "cannot import name ‘SpeechEncoderDecoder’ from ‘transformers’ "

Could you check what could be wrong? I can share my colab if needed.

Thanks for your help in advance.

patrickvonplaten · March 18, 2022, 4:40pm

Hey @thomasdaryl,

Could you provide a non-working code snippet?

Topic		Replies	Views
Facebook/wav2vec2-large-it-voxpopuli and /wav2vec2-large-it-voxpopuli seem broken Model cards	0	1540	December 28, 2021
Speech to text with mic and hugging-face transformers., getting empty results Beginners	0	590	September 9, 2022
Model broken on Hub: wav2vec robust 🤗Transformers	6	1648	November 5, 2021
Facebook/wav2vec2-large-xlsr-53 on the hub: tokenizer issue 🤗Hub	4	4030	March 18, 2022
[STT] Using huggingface pretrained models but different results =>Wav2Vec2 vs PatrickDemo 🤗Transformers	0	445	December 27, 2021

Wav2vec2-xls-r-2b-22-to-16 sample code not running

Related topics