Pyannote gives wrong results

laro1 · March 22, 2023, 2:46pm

I’m trying to use pyannote for speaker diarization
and I’m getting wrong number of speakers.

Any example I tried I got wrong results.

For example:

I used this youtube file:
https://www.youtube.com/watch?v=b2_ZZ2UpSzI
I convert it to wav file with sample rate of 16000.

I run the following code:

from pyannote.audio import Pipeline

TEST_FILE = "example.wav"
MY_TOKEN = "..."
pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization",
                                    use_auth_token=MY_TOKEN)

diarization = pipeline(TEST_FILE)

And I got the following diarization:

The GT contains 4 speakers and not 2.

How can I tweak pyannote and get better results ?

julien-c · March 23, 2023, 9:19am

tagging @hbredin just for visibility

hbredin · March 23, 2023, 9:31am

Duplicate of pyannote gives wrong results · pyannote/pyannote-audio · Discussion #1290 · GitHub

Topic		Replies	Views
Speaker Diarization Models	0	90	December 2, 2024
Pyannote/speaker-diarization - [WinError 1314] A required privilege is not held by the client Beginners	5	7029	June 22, 2024
Speaker Verification: All Speakers Getting Perfect 1.000 Similarity Scores Intermediate	0	25	February 10, 2025
Pyannotate pipeline() not working Intermediate	6	284	January 9, 2025
Combining pyannote with whisper to get a given speaker's text in Hebrew Beginners	1	365	January 14, 2025

Pyannote gives wrong results

Related topics