What are the latest Open Source Speech To Text Models with a focus on real-time

Hey, do you know current models that can also be executed locally, i.e. not in the cloud

1 Like

When it comes to locally executable models, the Whisper series seems to have a lot of know-how. However, there are other options as well.

In terms of speed, FastRTC excels in real-time performance, but it’s quite specialized. Or rather, it’s cloud-based?

1 Like

Yes, I already have Whisper on my shortlist and it seems to be the best option. I’ve also heard about

  • Kaldi
  • DeepSpeech
  • Vosk
  • SpeechBrain

Do you have any experience with these?

1 Like

Do you have any experience with these?

No.

1 Like

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.