Please, help me

testingemailst · December 16, 2021, 6:30pm

I have voice recordings of the names of the cities in my country I want to build dataset and convert it to speech recognation model for my contry cities
please help me with this
I don’t know if I should go to Mars I can’t find the answer
Please, help me

lhoestq · January 10, 2022, 11:50am

Hi ! You can create a dataset from your audio files with

from datasets import Dataset, Features, Audio

features = features({"audio": Audio()})
dataset = Dataset.from_dict({"audio": list_of_paths_to_my_audio_files}, features=features)

You can also find some documentation about audio dataset processing here: Process audio data — datasets 1.17.0 documentation

Topic		Replies	Views
Create the Moxilla Common Voice Data 🤗Datasets	2	816	November 15, 2022
Create own dataset of train and test in separate folders 🤗Datasets	1	773	January 26, 2023
I want train my own model speech recognation localy on my data my voice how to do that I can't find start I need very help 🤗Datasets	0	355	December 7, 2021
Audio dataset without uploading the data to the hub 🤗Datasets	6	1959	March 20, 2023
Create speech to text training dataset using text to speech model Intermediate	0	403	February 8, 2023

Please, help me

Related topics