Using a space as a backend?

mpnagle · April 20, 2023, 4:31pm

Hi Spaces!

I am new to HF Spaces. Here is the project I’m imagining:

I’m working on an iOS app to take a hand drawn image, pass it through ControlNet, and get back a diffusion image to then print out. The goal would be to bring this to a group of kids to let them play with diffusion and get a print out of their own work enhanced by Diffusion.

I know how to run ControlNet in a Colab notebook. I am wondering what the simplest translation of this is to a HF Space. I don’t necessarily need a frontend for the experience - I think we will build that in iOS - so I don’t think a Streamlit or Gradio frontend makes sense. Really I just want a persistent server which can serve as an endpoint to ControlNet (effectively the code from a Python notebook) for an iOS app. I see the word Docker and I flinch (“is Docker tech overkill”) but is that the right way to interact with HF spaces if what I effectively want to do is turn a Colab notebook into a persistent endpoint I can call from say an iOS app?

Thoughts welcomed!!! Thank you!

radames · April 20, 2023, 10:44pm

hello @mpnagle,

Spaces are designed for demos and sharable ML apps. You might be interested in our Inference Endpoints solution which provides exactly what you’re looking for, a dedicated, fully managed inference server. You might need to convert your Jupiter notebook to a custom handler python code and with a few clicks you can deploy and have an API endpoint

Luckily @philschmid wrote a really comprehensive tutorial about deploying ControlNet as an inference endpoint, so you can just copy adapt his handler.py

You can also see multiple models templates on our hub.

mpnagle · April 20, 2023, 11:49pm

Hi @radames –

Thank you! I think this is exactly what we’re looking for.

I will look further, but two quick questions:

I know HF provides grants for Spaces. Are there grants for endpoints? The end goal of this is an educational demo (exploring ControlNet with kids – draw a picture and pass it into ControlNet.) and if not –
I will look into this, but I think the main thing we will need up time for is a) testing and b) actual use with kids. I assume that’s easy to manage (turning an endpoint on and off?)

Thank you!

radames · April 21, 2023, 12:01am

Great!

I’m not sure about grants for Inference endpoints, since it’s a production ready service cc @philschmid

On another note, we’re going to release the image-to-image task, which includes ControlNet, in our API Inference with offers free access with rate limit and higher rates for Pro users, however this service is not optimized for high loads and optimized inference, but it could be a good start.

mpnagle · April 21, 2023, 2:26am

Thank you @radames !
Do you know roughly how soon you and your team will release the image-to-image task for API Inference?

mpnagle · April 27, 2023, 7:09pm

Hello again @radames!
I have now looked at the blog post you shared - which is exactly what we are looking for, an endpoint for ControlNet.
For pricing an endpoint - is it pay by use, or is provisioned for as long as the endpoint is up? I ask because the suggested GPU option from Phil’s blogpost, “GPU medium” is estimated at $900 / month, and “GPU small” is ~$400 / month.
I think if our project really got going we’d love to pay for a full time endpoint but for now we are just in the phases of building an iOS frontend to diffusion models so that we can run a few initial workshops for kids on drawing and diffusion and see if there really is a thing there. (I think we’d be using the endpoint a few hours a time at most.)
Do you have any suggestions – perhaps if there is a grant we could apply for like for HF spaces – or if there is a lower cost way to provision an endpoint with a GPU?
Thank you!!

radames · May 9, 2023, 8:24pm

hi @mpnagle , sorry for the delay, the image-to-image pipeline is now supported o most models with the correct tag task image-to-image

Here’s a tweet with more context, not sure what’s your framework, but using our JS library it’s very easy

await hf
  .imageToImage({
    inputs: imageBlob,
    model: "lllyasviel/control_v11f1p_sd15_depth",
    parameters: {
      prompt: "Elmo's Lecture in a tropical beach in the background"
    }
  })
  .then((blob) => {
    const image = new Image();
    image.src = URL.createObjectURL(blob);
    return image;
  })

thawman · November 20, 2024, 10:23am

it seems that the endpoint generates cost for the whole duration of being up, not only for computing duration. how did you manage this so that you have both low cost and decent user experience?

Topic		Replies	Views
Create API Endpoint from hugging face space Spaces	0	1414	June 11, 2024
Adapting a model from Spaces to Inference Endpoint Inference Endpoints on the Hub	3	1998	November 25, 2022
Deploy model on HF Space for production Spaces	0	987	March 11, 2022
Spaces to API converting Inference Endpoints on the Hub	0	384	August 16, 2023
Serverless Spaces Spaces	2	383	January 18, 2024

Using a space as a backend?

Related topics