PreTrain Wav2Vec2 in Dhivehi
There is currently only a multilingually pretrained model for Dhivehi Wav2Vec2. We would like to make a Wav2Vec2 only pre-trained on Dhivehi.
Model
A randomly initialized Wav2Vec2 model
Datasets
- commonvoice has 18hrs in the last released dataset. [ 32hrs+ if mid 2021 dataset released in time]
- podcast data [30hr]
- others []
Available training scripts
FlaxWav2Vec2 will be merged soon: [Flax] Add wav2vec2 by patrickvonplaten · Pull Request #12271 · huggingface/transformers · GitHub and a pretraining script should be relatively easy to be merged.
(Optional) Desired project outcome
The best Dhivehi ASR model
(Optional) Challenges
scraping publicly available Dhivehi audio from various sources