Speech language detection using Wave2vec 2.0

I wanted to know whether if we can train Wave2vec 2.0 to detect languages from audio. Like speech to text work only if we know the original language of the audio. Can we detect the language from audio files? Any input will be helpful

Do you mean classifying the language with given audio? How much big is your data? It is an easy task by using a few convolution layer with a few language.

Yes,@EmreOzkose Can you suggest some script or library for classifying the language with the given audio? I couldn’t find any SOTA in this task. I have a large amount of audio data for my task.

You should check this challenge which has 3 task for spoken language detection and nice SOTA approaches. Some of these methods also contains transformer. This means you can use Huggingface.

1 Like