Create speech to text training dataset using text to speech model

bjubert · February 8, 2023, 3:44pm

Hello,

I want to use Whisper, a model for speech recognition but I have an issue. The problem is the way of people speak in my audio files is very specific so whisper encounters difficulties.

I had the idea to try to fine tune the model for better results but i don’t have enough data to do it. So my question is : is it possible to use a text to speech model to create a dataset to train a speech to text model ?

Thanks in advance.

Topic	Replies	Views
I want train my own model speech recognation localy on my data my voice how to do that I can't find start I need very help 🤗Datasets	371	December 7, 2021
Using inference api on model that returns an audio file Models	389	November 23, 2021
Unable to find Speech2Text model Models	236	March 5, 2021
Using inference api on espnet/kan-bayashi_ljspeech_vits model Beginners	395	November 27, 2021
SpeechBrain EncoderDecoderASR transcribe_file() Runs out of Memory Models	500	April 17, 2022

Create speech to text training dataset using text to speech model

Related topics