Speech language detection using Wave2vec 2.0

Parth · March 24, 2021, 5:11am

I wanted to know whether if we can train Wave2vec 2.0 to detect languages from audio. Like speech to text work only if we know the original language of the audio. Can we detect the language from audio files? Any input will be helpful
@valhalla

EmreOzkose · March 24, 2021, 5:36am

Do you mean classifying the language with given audio? How much big is your data? It is an easy task by using a few convolution layer with a few language.

Parth · March 24, 2021, 5:40am

Yes,@EmreOzkose Can you suggest some script or library for classifying the language with the given audio? I couldn’t find any SOTA in this task. I have a large amount of audio data for my task.

EmreOzkose · March 24, 2021, 5:58am

You should check this challenge which has 3 task for spoken language detection and nice SOTA approaches. Some of these methods also contains transformer. This means you can use Huggingface.

Topic		Replies	Views
Wav2Vec2 for Audio Emotion Classification 🤗Transformers	6	8191	May 26, 2021
Using Wav2Vec in speech classification/regression problems Languages at Hugging Face	13	9615	November 16, 2022
Wav2vec2-XLS-R Language Identification downstream task weights Community Calls	0	946	March 31, 2022
Wav2vec For Music Applications (generation, captioning, instrument classification) Flax/JAX Projects	2	1506	July 3, 2021
Model for Audio classification 🤗Transformers	2	1198	January 23, 2023

Speech language detection using Wave2vec 2.0

Related topics