Common voice dataset 15.0 version release

Hi All and especially @reach-vb!
I need some help with the common voice dataset. The Georgian dataset has increased recently up to 154 hr total (Common Voice). I am exploring the options for STT model training. I have seen the 13.0 version of the common voice dataset on huggingface and I was wondering if 15.0 is in progress and will see it soon.

I also see the option for loading locally similar to this thread:

But I am wondering if the latest dataset version will be added soon.
I would try to do PR but the 15.0 dataset repo needs to be created first as for 13.0 (mozilla-foundation/common_voice_13_0).


1 Like

@reach-vb should be able to help with that.