Please, help me

I have voice recordings of the names of the cities in my country I want to build dataset and convert it to speech recognation model for my contry cities
please help me with this
I don’t know if I should go to Mars I can’t find the answer
Please, help me

Hi ! You can create a dataset from your audio files with

from datasets import Dataset, Features, Audio

features = features({"audio": Audio()})
dataset = Dataset.from_dict({"audio": list_of_paths_to_my_audio_files}, features=features)

You can also find some documentation about audio dataset processing here: Process audio data — datasets 1.17.0 documentation