Using Wav2Vec in speech classification/regression problems

darkcurrent · June 12, 2021, 1:58pm

Great work, I see that you created a script which can decide regression or classification is going to be used by looking the “num_labels” extracted from csv files.

I am trying to estimate some neurological scores from sound for Parkinson’s disease patients.

And I am going to try to build a six dimensional regression model. I will give to wav2vec model a wav file and give six floating point labels means an array of six elements and want to make the model predict these labels.

The question is how can I prepare the CSV files and feed the model?

And do I need to make any change to existing model? You have added a classification layer at the top of the model which has two parameters config.hidden_size and config.num_labels. I think I have to change these parameters right?

Could you give some help please?

Regards,

Topic		Replies	Views
Wav2Vec2 for Audio Emotion Classification 🤗Transformers	6	8172	May 26, 2021
Making predictions in Boosting wav2vec2 with n-grams Models	2	414	October 25, 2022
Can someone give me a simple example on how to train Wav2Vec2 for audio frame classification? Models	1	292	January 7, 2025
Wav2vec2 finetuning and language model Beginners	0	213	October 1, 2023
Greek ASR: Finetuning using Wav2Vec 2.0 Languages at Hugging Face	0	513	March 24, 2021

Using Wav2Vec in speech classification/regression problems

Related topics