Pyannote gives wrong results

I’m trying to use pyannote for speaker diarization
and I’m getting wrong number of speakers.

Any example I tried I got wrong results.

For example:

I run the following code:

from import Pipeline

TEST_FILE = "example.wav"
MY_TOKEN = "..."
pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization",

diarization = pipeline(TEST_FILE)

And I got the following diarization:
enter image description here

  • The GT contains 4 speakers and not 2.

How can I tweak pyannote and get better results ?

tagging @hbredin just for visibility

Duplicate of pyannote gives wrong results · pyannote/pyannote-audio · Discussion #1290 · GitHub