Is there any music vocals/voice-to-text model?

lucianohc · July 19, 2023, 10:35pm

Hello everyone!

I had this idea some time ago, about a model that could extract the vocals from a song and then return the lyrics to the user.
The Split Audio Tracks to MusicGen extracts the vocals pretty well, but I can’t save the extracted audio. If I could, then I could feed it to another tool to transform the vocals into text.

So, my idea consists in joining these two processes together in one tool. The user uploads the file, or an url to the song, and the model does its job. Does anyone know if such a tool already exists?

This is my first post here, sorry for the long text…
loving this community!

Cheerz

Luciano

Topic		Replies	Views
(Audio-to-audio models) Should I use 2 models sequentially or create 1 model for attempting to make a music to music model? 🤗Transformers	0	107	April 26, 2024
Best model for music generation Models	3	1702	December 31, 2024
Great news! Time to generate music! Beginners	1	1052	October 31, 2024
Create a pop music Transformer 🤗 Course Projects	2	2457	November 17, 2021
MusicGen Audio Prompt, need help 🤗Transformers	0	262	August 9, 2023

Is there any music vocals/voice-to-text model?

Related topics