Hindi ASR: Fine-Tuning Wav2Vec2

hadarishav · March 18, 2021, 11:32am

Hi guys! I am planning to work on fine-tuning XLSR-Wav2Vec2 in Hindi. Would love to collaborate with others working on it too. We can share and discuss issues here.

danurahul · March 18, 2021, 5:02pm

hii @hadarishav i am also working on it tooo.

gchhablani · March 20, 2021, 4:57am

Hi @danurahul, @hadarishav

I don’t think common voice has ample amount of Hindi data, the given data when trained doesn’t lead to any reduction in the WER.

Let me know if there are any other datasets that you find

Thanks,
Gunjan

danurahul · March 20, 2021, 10:46am

https://drive.google.com/drive/folders/1s5bAozkso28BLCvwG0jE2T0Oj_ONEejv

check this data its very huge

gchhablani · March 20, 2021, 1:27pm

Hi @danurahul,

Thanks a lot, I’ll look into it.

Do you have a LICENSE to use this data? or a paper you can cite?

danurahul · March 21, 2021, 3:14am

Its freely availablr for competition

shiwangi27 · March 22, 2021, 7:06pm

Joining a little late to the party but would love to collaborate too for Hindi ASR.

gchhablani · March 22, 2021, 8:07pm

Has anyone fine-tuned on a model on Hindi yet?

hadarishav · March 22, 2021, 8:23pm

@danurahul Thanks for sharing the resource. This looks promising.
@gchhablani You can find more details on the dataset here - GitHub - Speech-Lab-IITM/Hindi-ASR-Challenge: 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
They make the dataset freely available for research. So I think, we can use it safely!
I have started working today. Reading the docs and other instructions. Will get to the code soon. Will start with exploring the data first. How about you?

niksarrow · March 22, 2021, 9:58pm

Does anyone know of a code-snippet to load the dataset from a drive for use in this fine-tuning task? Much appreciated!!

theainerd · March 23, 2021, 1:53am

Hello ,

I am working on Hindi as well. Currently I am able to achieve 0.54 validation WER as well as 72 percent WER on test dataset. I will be updating the full info soon.

hadarishav · March 23, 2021, 8:24am

@theainerd Which dataset are you using?

theainerd · March 23, 2021, 2:24pm

I am using Open SLR dataset. I tried training on common voice but the dataset is too small and test WER remain 1 only.

skylord · March 23, 2021, 11:49pm

Does anyone have access to this database:

https://www.iitm.ac.in/donlab/tts/database.php

It seems they do not give access unless you are affiliated to an institute.

gchhablani · March 24, 2021, 1:06pm

Hi,

Look up mounting drive on Colab
If you’re using a local setup, then I’m not fully sure.

gchhablani · March 24, 2021, 1:07pm

Which Open SLR dataset? Can you share the link here please? Is it the INTERSPEECH 2021 one?

raikarsagar · March 24, 2021, 1:23pm

Hi, joined late… Would like to support others working on finetuning hindi ASR
-Sagar

nalli212 · July 7, 2021, 5:12am

hey, I’m working in the ASR domain. let me know how to get in touch so we can collaborate!

testingemailst · December 10, 2021, 1:35am

how to do that trained huggingface model speech recognation on my own dataset? how i can start ? i don’t know the structure of the dataset? help… very help
how I store voice and how to lik with its text how to orgnize that
I an looking for any one help me in this planet
Should I look for the answer in Mars?

Harveenchadha · January 4, 2022, 4:44pm

Do check my ASR model of hindi finetuned on 4000 hours of high quality data.

Topic		Replies	Views
Kannada ASR: Fine-Tuning Wav2Vec2 Languages at Hugging Face	1	1048	March 22, 2021
Tamil ASR: Fine-Tuning Wav2Vec Languages at Hugging Face	2	649	April 1, 2021
Indonesian ASR: Fine-Tuning Wav2Vec2 Languages at Hugging Face	35	2566	March 1, 2023
Assamese ASR: Finetuning Wav2Vec2 Languages at Hugging Face	0	524	March 28, 2021
Marathi ASR: Fine-Tuning Wav2Vec2 Languages at Hugging Face	2	594	March 24, 2021

Hindi ASR: Fine-Tuning Wav2Vec2

Related topics