I worked out an example where i creating a speaker embedding for my own voice. I recorded some WAV and it kind of worked. However, while using that embedding, the output is like a pure robot. Am i missing something? should there be a particular speech?
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
How to embed Hugging Face Pre-trained models in our own app | 2 | 898 | March 26, 2021 | |
I want train my own model speech recognation localy on my data my voice how to do that I can't find start I need very help | 0 | 355 | December 7, 2021 | |
Can you use the Same embeddings of Wav2Vec XLSR and apply different ASR heads? | 0 | 239 | June 2, 2022 | |
How to extract embeddings in Wav2Vec2? | 0 | 432 | April 29, 2022 | |
Help for using whisper with embeddings | 1 | 424 | November 22, 2023 |