Can we use Inference Endpoints to make input a music file and output a music file?

hironow · June 6, 2023, 5:08pm

Hello everyone, I’m exploring the use of Hugging Face’s Inference Endpoints for a project involving music files. My goal is to input a music file and have the model generate a music file as output. I understand that the feasibility of this would depend on the specific model being used and whether it has been trained to process and generate music files.

However, I’m aware that handling music files as direct output might not be possible at the moment. As an alternative, I’m considering having the output music file stored in an AWS S3 bucket or Google Cloud Storage after inference. Has anyone implemented something similar, or could provide guidance on how to achieve this? Are there any specific models (as example) you would recommend for this task?

Thank you in advance for your help.

philschmid · June 7, 2023, 8:06am

You can also encode your audio file with base64 and return it as string, but uploading to s3 and then returning the URL makes more sense especially for larger files.

hironow · June 7, 2023, 10:16am

Thank you for your response, @philschmid

I agree that uploading the output music file to a cloud storage service like Amazon S3 or Google Cloud Storage and then returning the URL seems to be a more efficient and practical solution, especially for larger files.
This approach would also provide a more seamless experience for users, as they can directly download the file from the provided URL.

Thanks again for your help!

hironow · June 24, 2023, 3:46pm

I was looking for documentation on saving to an arbitrary storage (assuming access permissions are controlled by environment variables).

I would like to set environment variables for each endpoint, but I can’t seem to find it in the documentation. Maybe it can’t be done…?

nabeelimran · July 1, 2023, 11:40am

have you succeeded in your task
is your model on rvc architecture by any chance?

nabeelimran · July 1, 2023, 11:41am

have you succeeded
was your model RVC?

Topic		Replies	Views
How to run text to speech from inference endpoint given audio file url? Beginners	1	899	June 8, 2023
Upload file to HF Inference endpoints? Beginners	0	140	February 22, 2024
Using inference api on model that returns an audio file Models	0	377	November 23, 2021
Fix sample code to deploy and inference tasks using audio/image files Amazon SageMaker	2	919	November 21, 2022
Inference Endpoint for batch jobs Inference Endpoints on the Hub	0	294	May 24, 2024

Can we use Inference Endpoints to make input a music file and output a music file?

Related topics