Wav2Vec2 for Audio Emotion Classification

@Winstead, It would probably solve your problem.

2 Likes