How to use audio augmentations for audio classification

tajinderpalsingh61 · July 6, 2023, 6:37am

Hi, I am very new to hugging face and audio datasets

I have an audio dataset in a folder, which I loaded into the dataset.

(Dataset({
features: [‘audio’, ‘label’],
num_rows: 50
}),

above “audio” is a dict with keys sampling_rate, path, array
I am doing audio classification by using cast_column to Audio then used feature extractor and then finetuned the model. It works fine.

I noticed there is a class imbalance, I used to work with image data generator which provides data augmentation and handles class by generating augmented samples.

can we do something similar in huggingface dataset or how can I generate augmented data to handle low-number classes?

Thanks.

Zahra99 · February 27, 2024, 11:37am

Hi, I have this question and I want to augment my audio dataset. Do you find answer for this question?
If you have, I’d greatly appreciate it if you could share your findings or any progress you’ve made.

uk390507 · March 8, 2024, 7:39pm

same problem, could you find any solution?

faeroes · November 18, 2024, 9:47am

I am also interested in this. Registering my interest to let people know this topic is in demand.

Topic		Replies	Views
Loading custom audio dataset and fine-tuning model Beginners	6	3270	December 12, 2023
Handling Imbalanced Dataset 🤗Datasets	0	181	June 20, 2024
Is there a data set on huggingface that has classified audio emotion? 🤗Datasets	1	34	March 13, 2025
Create the Moxilla Common Voice Data 🤗Datasets	2	827	November 15, 2022
Audio dataset without uploading the data to the hub 🤗Datasets	6	1971	March 20, 2023

How to use audio augmentations for audio classification

Related topics