AI Voice Assistant

Trying to setup a voice assistant. A bit of a n00b and just figuring stuff out, but I am trying to find out if there’s any end-to-end voice solution. if there isn’t, is there a functioning solution design, repo for STT - Wake Up Word - Prompt - Text - TTS.

Pretty much a Voice to Voice solution out there. I’ll appreciate help of any kind;

  • how to think about it, if its possible to solve
  • Link to a repo
  • If it’s currently possible or not.

Would appreciate any help

2 Likes

For methods that can be used with many models, I think the Audio Course article on Hugging Face would be helpful.

Currently, FastRTC is probably the easiest, fastest, and cheapest option. However, it has not been released for very long, so some adjustments may still be necessary.

Thanks a lot! Will spend some time on these.

1 Like