Creating a dataset with many utterances per audio file?

Gweltaz · December 16, 2023, 9:53pm

Hi,

I have an audio+transcripts corpus consisting of long audio files. Each audio file has its own metadata file where the time code of each segment is defined along with its transcript.
Is there a way to generate a HF dataset from this structure without having to split the original audio files to single utterance audio files ?
Datasets like AMI (edinburghcstr/ami · Datasets at Hugging Face) and others do have a “begin_time” and “end_time” column, but it doesn’t look that those fields are being used in the dataset script either…

Thanks

Topic		Replies	Views
How does one actually create a new dataset? Beginners	2	3233	October 18, 2024
How to create a dataset for "audio-like" files for ASR Beginners	0	401	April 10, 2023
Misunderstanding around creating audio datasets from Local files 🤗Datasets	12	1754	July 17, 2023
Can Data Files be generated upon dataset load? Beginners	3	453	March 4, 2022
Creating a new dataset Beginners	1	246	February 13, 2024

Creating a dataset with many utterances per audio file?

Related topics