Issue with Using pyannote/speaker-diarization Gated Model in Colab and API

Hi everyone,

I’m currently working on my engineering thesis, which involves analyzing speaker diarization for Polish audio files. I’m trying to use the pyannote/speaker-diarization model to perform this task, but I’ve encountered several issues that I haven’t been able to resolve, even after trying various approaches. Here’s a summary of my problem:

  1. Access Granted but Model Not Running:
  • I’ve been granted access to the gated model on Hugging Face. However, when I try to use the model through the Inference API widget on the model’s page, I receive the error: “The API does not support running gated models for community models with framework: pyannote-audio.”
  1. Attempt with Google Colab:
  • I tried running the model locally using Google Colab and pyannote.audio. Despite authenticating successfully with my Hugging Face token, I encountered the following error: “AttributeError: ‘NoneType’ object has no attribute ‘eval’.”
  1. Context of My Work:
  • As part of my thesis, I need to analyze audio files and identify the number of speakers present. I have limited time to complete this, and I’m unsure how to proceed given these obstacles.

Could you please guide me on:

  • How to properly use the model via the Inference API or another simple method.
  • Resolving the issues I’m encountering in Colab.
  • Any alternative approaches for speaker diarization with Hugging Face models.

I would greatly appreciate any help or suggestions, as this is a critical part of my thesis, and I’m running out of time.

Thank you in advance!

Best regards,
Ola

1 Like