Help in finetuning ASR models

Davide85 · January 12, 2023, 6:55pm

Hi all,

Excuse me, I am a newbie. My problem is about the finetuning of ASR models (especially facebook/s2t-small-librispeech-asr · Hugging Face) on my custom dataset consisting of my recording. Where can I find a piece of example for that?

Thanks in advance,

Davide

stevhliu · January 12, 2023, 7:03pm

Hi, there is a task guide for ASR in the docs here

Davide85 · January 13, 2023, 6:05pm

Thank you so much. The above guide uses a transformers dataset object, while, in my case, I have the raw wave files. How I can convert these data in order to finetune the ASR model?
Thank you,
Davide

stevhliu · January 13, 2023, 7:00pm

You can create your own audio dataset with your files to get a Dataset object.

The easiest option is probably the AudioFolder builder. You just have to create a dataset repo on the Hub and upload your audio files to it. Then you can load it like:

from datasets import load_dataset
dataset = load_dataset("audiofolder", data_dir="/path/to/data")

Check out the docs here for more details!

Topic		Replies	Views
How to create a dataset for "audio-like" files for ASR Beginners	0	405	April 10, 2023
Creating a new dataset Beginners	1	252	February 13, 2024
Loading custom audio dataset and fine-tuning model Beginners	6	3258	December 12, 2023
How to finetune whisper model 🤗Transformers	0	573	May 7, 2023
How to import a custom dataset to fine tune wav2vec Beginners	0	920	October 19, 2022

Help in finetuning ASR models

Related topics