Wav2vec2 finetuning custom dataset

Shiro · July 16, 2021, 12:19am

Hello,

Thank you for sharing such nice model again on this framework.

I am trying to finetune a wav2vec2 model on a custom dataset (so not from the dataset package of huggingface). I have tried to follow these two tutorials :

Fine-Tune Wav2Vec2 for English ASR in Hugging Face with 🤗 Transformers
Fine-tuning with custom datasets — transformers 4.7.0 documentation
but I did not find how to use multiprocessing when using trainer on a custom dataset ? Should I use dataloader class from torch ? For now I am using the regular Dataset class from torch.

I also encoutered memory issue with the GPU (16 gb) with a base Wav2vec2 model, even with batch size = 1. What is the maximum batch size for a base and large model for 16 gb, and with what length of sample ? (using fp16).

I thank you for the help

theainerd · July 28, 2021, 5:32am

You can convert your custom dataset if its a dataframe into Hugging Face format using this.

from datasets import Dataset, load_metric

train_data = Dataset.from_pandas(train_df)
test_data = Dataset.from_pandas(test_df)

If you are facing memory related problem, set num_proc = 1
It solved my problem.

rokayabn · December 25, 2024, 8:30pm

hello i want to finetune that model but on arabic tartil dataset what changes should i do ?

Topic		Replies	Views
Loading custom audio dataset and fine-tuning model Beginners	6	3238	December 12, 2023
How to import a custom dataset to fine tune wav2vec Beginners	0	912	October 19, 2022
I want to custom my data set in speech recognition wav2vec Beginners	1	828	August 9, 2021
Fine-tune Wav2Vec2ForCTC from pre-finedtuned model Beginners	0	385	January 23, 2022
German ASR: Fine-Tuning Wav2Vec2 Languages at Hugging Face	17	3681	February 18, 2022

Wav2vec2 finetuning custom dataset

Related topics