How to use the wav2vec2-large-TIMIT-IPA2 model?

nguyenhongphat0 · June 4, 2023, 8:27am

I need to use the https://huggingface.co/speech31/wav2vec2-large-TIMIT-IPA2 model to convert speech to International Phonetic Alphabet. I don’t know how can I pass in the audio file and get the string output:

from transformers import AutoProcessor, AutoModelForCTC

processor = AutoProcessor.from_pretrained("speech31/wav2vec2-large-TIMIT-IPA2")

model = AutoModelForCTC.from_pretrained("speech31/wav2vec2-large-TIMIT-IPA2")

audio_file = "./male-voice.wav"

ipa_output = # what should I do next?

print(ipa_output)

Thank you for your help!

Topic		Replies	Views
Pretrained wav2vec2 speech to text - decoded text is gibberish Models	0	401	June 12, 2023
Use wav2vec2 models with a microphone easily Beginners	2	3664	May 28, 2021
Different versions of 'wav2vec2' model and their differences Beginners	1	1510	August 7, 2021
Wav2Vec2ForCTC not working for my own wav file 🤗Transformers	0	869	November 22, 2021
Exporting model wav2vec2 not supported? 🤗Optimum	3	1234	August 10, 2023

How to use the wav2vec2-large-TIMIT-IPA2 model?

Related topics