Hi everyone,
I’m currently working on my engineering thesis, which involves analyzing speaker diarization for Polish audio files. I’m trying to use the pyannote/speaker-diarization
model to perform this task, but I’ve encountered several issues that I haven’t been able to resolve, even after trying various approaches. Here’s a summary of my problem:
- Access Granted but Model Not Running:
- I’ve been granted access to the gated model on Hugging Face. However, when I try to use the model through the Inference API widget on the model’s page, I receive the error: “The API does not support running gated models for community models with framework: pyannote-audio.”
- Attempt with Google Colab:
- I tried running the model locally using Google Colab and
pyannote.audio
. Despite authenticating successfully with my Hugging Face token, I encountered the following error: “AttributeError: ‘NoneType’ object has no attribute ‘eval’.”
- Context of My Work:
- As part of my thesis, I need to analyze audio files and identify the number of speakers present. I have limited time to complete this, and I’m unsure how to proceed given these obstacles.
Could you please guide me on:
- How to properly use the model via the Inference API or another simple method.
- Resolving the issues I’m encountering in Colab.
- Any alternative approaches for speaker diarization with Hugging Face models.
I would greatly appreciate any help or suggestions, as this is a critical part of my thesis, and I’m running out of time.
Thank you in advance!
Best regards,
Ola