There is currently only a multilingually pretrained model for Russian Wav2Vec2. Let’s make a Wav2Vec2 only pretrained on Russian.
The model will be trained in Russian.
A randomly initialized Wav2Vec2 model
We can make use of run_wav2vec2_pretrain_flax to train the model.
Dataset have many hours, training will take a long time
The best Russian ASR model.