Speaker diarization with Whisper?

rajistics · January 31, 2023, 3:50pm

Any suggestions for speaker diarization with Whisper? pyannote or there other alternatives?

sanchit-gandhi · January 31, 2023, 4:28pm

There’s support for Whisper + pyannote speaker diarization in Speechbox: GitHub - huggingface/speechbox

In my experience, the pre-trained pyannote models work very well, but there’s the option of fine-tuning these models too.

We can drop in any fine-tuned Whisper/pyannote models directly into the Speechbox pipeline

Topic		Replies	Views
Pyannote/speaker-diarization-3.1 recognising a particular speech Beginners	1	43	May 5, 2025
Speaker Diarization Models	0	89	December 2, 2024
Combining pyannote with whisper to get a given speaker's text in Hebrew Beginners	1	361	January 14, 2025
Diarization with unknown number of speakers Models	1	1588	October 28, 2022
Pyannote gives wrong results Beginners	2	924	March 23, 2023