(Audio-to-audio models) Should I use 2 models sequentially or create 1 model for attempting to make a music to music model?

aryanv · April 26, 2024, 7:35am

I had the idea of creating a service that converts a song of one genre to transform into a song of the same lyrics but in a different genre and instrumental

I was wondering if it would make more sense to first use a audio to text model to get the lyrics and then use one of the text to music models to create the song in the new genre or if I should attempt to train a new audio to audio model to do both tasks at the same time.

Topic		Replies	Views
Is there any music vocals/voice-to-text model? Beginners	0	1021	July 19, 2023
Best model for music generation Models	3	1613	December 31, 2024
What type of model should I use to combine 4 album covers and have a coherent output image Beginners	0	201	May 1, 2023
Audio Style transfer Models	0	635	August 28, 2023
How to train a text-to-sing model? Beginners	0	433	February 6, 2024

(Audio-to-audio models) Should I use 2 models sequentially or create 1 model for attempting to make a music to music model?

Related topics