I’m new to machine learning and to Hugging Face, so please tell me if I should be asking this question in a different place or in a different way. (I’m only allowed two links, so I’ve had to disable some links with “-dot-”.)
On two website for making video animations, Kreado (KreadoAI-dot-com) and [HeyGen (HeyGen-dot-com), there is a selection of text-to-speech voices. The user inputs a script in text and selects a voice, and the software generates the sound of that voice speaking the script. On both websites, the user has to be logged in to a free account in order to see the list of voices available. When one is logged in, Kreado offers 31 voices at KreadoAI-dot-com/ai/dubbing, while HeyGen has 92 at app.HeyGen-dot-com/voices. Each voice is identified by a unique name, such as Christopher and Elizabeth.
With so many voices, it can be hard to tell the difference between one and another. However, there is one voice that is very different from all the others, but seems to be the same voice on both sites. It is the voice of a little girl, and both sites call her Ana. For those people here who don’t have accounts on those sites, I’ve used the Windows Game Bar to save the samples each site provides of Ana’s voice:
It’s possible that both websites set their parameters for generating the voice of a little girl so similarly that their voices come out sounding the same, and it’s possible that both sites just happened to give their little-girl voices as well as 15 other voices the same names as the other site, but that seems highly unlikely. What is more likely is that both sites have obtained these voices from another source, or perhaps Kreado is licensing its smaller selection of voices from HeyGen.
I want to know the common source of these voices so that I can work directly with the technology instead of an implementation. I’m so new here that I don’t know if I’m asking about a model or a dataset or a space or something else. But is there a way to find the common source of those two voices?