We are having a thesis project on Podcast Trailer Generation - Hotspot Detection for Podcast Dataset at Spotify. The Spotify Podcast Dataset contains both transcript and audio data for many podcast episodes, and currently we are looking to use Wav2Vec2 embeddings as input to train an emotion classi…

Wav2Vec2 for Audio Emotion Classification

m3hrdadfi May 25, 2021, 8:28am 6

@Winstead, It would probably solve your problem.

2 Likes