Pyannotate pipeline() not working

shantanu2698 · January 6, 2025, 4:04pm

I am using pyannotate for speaker diarization on top of Whisper. However, even though I have provided the token credential, the code consistently gets stuck at the pipeline() call. This eventually leads to the Colab session disconnecting. The video in question is relatively short, only 11:46 minutes. I have attached the code for your reference.

from pyannote.audio import Pipeline
wav_file=“download1.wav”

Load the pretrained diarization pipeline

pipeline = Pipeline.from_pretrained(“pyannote/speaker-diarization”, use_auth_token=“hf_xxxxxxxxxxxxxxxxxxx”
)

Apply the pipeline to the audio fie

diarization = pipeline(wav_file) ######this is where code is getting stuck

Save the diarization output to a file

with open(“diarization.txt”, “w”) as f:
for turn, _, speaker in diarization.itertracks(yield_label=True):
f.write(f"{turn.start:.2f} - {turn.end:.2f}: Speaker {speaker}\n")

print(“Speaker diarization completed.”)

John6666 · January 6, 2025, 6:02pm

I wonder if the Pyannotate settings are wrong…

shantanu2698 · January 8, 2025, 10:20am

I am still stuck at this point. Even though I have been granted model permission, it’s not running and keeps on getting stuck at pipeline(“download1.wav”). I have accepted the user conditions as well. I need urgent help

shantanu2698 · January 8, 2025, 10:21am

Thanks for the reply @John6666 . But the issue remains the same, Even though I have followed all the steps mentioned on the shared post.

John6666 · January 8, 2025, 12:51pm

I thought it was possible that your GPU wasn’t being used. How about this?

# https://github.com/pyannote/pyannote-audio
from pyannote.audio import Pipeline
import torch

hf_token = "hf_xxxxxxxxxxxxxxxxxxx"
device = "cuda" if torch.cuda.is_available() else "cpu"
wav_file="download1.wav"

#Load the pretrained diarization pipeline
pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization", use_auth_token=hf_token).to(device) # send pipeline to GPU (when available)
# pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization", device_map="auto", use_auth_token=hf_token) # if your GPU is weak... ensure "pip install -U accelerate" before use it

# Apply the pipeline to the audio fie
diarization = pipeline(wav_file) ######this is where code is getting stuck

#Save the diarization output to a file
with open("diarization.txt", "w") as f:
    for turn, _, speaker in diarization.itertracks(yield_label=True):
       f.write(f"{turn.start:.2f} - {turn.end:.2f}: Speaker {speaker}\n")

print("Speaker diarization completed.")

Navanit-AI · January 9, 2025, 8:50am

The pyannote doesn’t ever starts with 0 is there any reason ?

Alanturner2 · January 9, 2025, 8:58am

It looks like your pipeline() call is hanging. Here are a few things to check:

Check Token: Make sure your Hugging Face token is valid and has the right permissions.
Audio File: Double-check that the audio file path (download1.wav) is correct and accessible. Try with a smaller file to see if that helps.
Colab Resources: Colab has limited resources. You can check the GPU memory usage with:
```
!nvidia-smi
```
If it’s too high, Colab might disconnect.

Debugging: Add print statements to check where the code is stopping:

print("Loading model...")
pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization", use_auth_token="hf_xxxxxxxxxxxxxxxxxxx")
print("Model loaded, applying pipeline...")
diarization = pipeline(wav_file)
print("Diarization completed.")

Run Locally: If it still doesn’t work, try running the code locally or on a different platform (AWS, GCP).

If these steps don’t help, you might want to try a different diarization library like SpeechBrain or pyAudioAnalysis.

Good luck!

Topic		Replies	Views
Issue with Using pyannote/speaker-diarization Gated Model in Colab and API Beginners	3	206	January 9, 2025
Pyannote.speaker_diarization giving 401 Client Error Models	16	8080	July 10, 2025
Pyannote/speaker-diarization - [WinError 1314] A required privilege is not held by the client Beginners	5	7096	June 22, 2024
Pyannote gives wrong results Beginners	2	934	March 23, 2023
Inference Endpoint: `'NoneType' object is not callable`. No other context to go on [Pyannote speaker diarization] Beginners	5	2958	November 13, 2022

Pyannotate pipeline() not working

Load the pretrained diarization pipeline

Apply the pipeline to the audio fie

Save the diarization output to a file

Related topics