What are 'min_duration_off' and 'threshold' means (segmentation)

laro1 · March 29, 2023, 1:14pm

printing the pipeline parameters of pyannote.audio (speaker-diarization)
(pipeline.parameters(instantiated=True)) gives:

{
'segmentation':
      {
      'min_duration_off': 0.5817029604921046,
      'threshold': 0.4442333667381752
     },
'clustering':
   {
     'method': 'centroid',
     'min_cluster_size': 15,
   'threshold': 0.7153814381597874
 }
}

I read the article of the segmentation model (End-to-end speaker segmentation for overlap-aware resegmentation)
and still don’t understand, what is the meaning of min_duration_off and threshold ?

Musador13 · September 19, 2023, 12:02am

min_duration_on - remove speech regions shorter than that many seconds.
min_duration_off - fill non-speech regions shorter than that many seconds.

Topic		Replies	Views
How to interpret the output of the segmentation model? Models	0	239	April 4, 2023
Speaker Diarization Models	0	90	December 2, 2024
Diarization with unknown number of speakers Models	1	1589	October 28, 2022
Pyannotate pipeline() not working Intermediate	6	284	January 9, 2025
Pyannote/speaker-diarization - [WinError 1314] A required privilege is not held by the client Beginners	5	7030	June 22, 2024

What are 'min_duration_off' and 'threshold' means (segmentation)

Related topics