Hi guys! I am planning to work on fine-tuning XLSR-Wav2Vec2 in Hindi. Would love to collaborate with others working on it too. We can share and discuss issues here.
hii @hadarishav i am also working on it tooo.
I don’t think common voice has ample amount of Hindi data, the given data when trained doesn’t lead to any reduction in the WER.
Let me know if there are any other datasets that you find
check this data its very huge
Thanks a lot, I’ll look into it.
Do you have a LICENSE to use this data? or a paper you can cite?
Its freely availablr for competition
Joining a little late to the party but would love to collaborate too for Hindi ASR.
Has anyone fine-tuned on a model on Hindi yet?
@danurahul Thanks for sharing the resource. This looks promising.
@gchhablani You can find more details on the dataset here - GitHub - Speech-Lab-IITM/Hindi-ASR-Challenge: 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
They make the dataset freely available for research. So I think, we can use it safely!
I have started working today. Reading the docs and other instructions. Will get to the code soon. Will start with exploring the data first. How about you?
Does anyone know of a code-snippet to load the dataset from a drive for use in this fine-tuning task? Much appreciated!!
I am working on Hindi as well. Currently I am able to achieve 0.54 validation WER as well as 72 percent WER on test dataset. I will be updating the full info soon.
@theainerd Which dataset are you using?
I am using Open SLR dataset. I tried training on common voice but the dataset is too small and test WER remain 1 only.
Does anyone have access to this database:
It seems they do not give access unless you are affiliated to an institute.
Look up mounting drive on Colab
If you’re using a local setup, then I’m not fully sure.
Which Open SLR dataset? Can you share the link here please? Is it the INTERSPEECH 2021 one?
Hi, joined late… Would like to support others working on finetuning hindi ASR