About the Inference Endpoints on the Hub category

jeffboudier · September 27, 2022, 5:56pm

This category is to ask questions about Inference Endpoints, our production inference solution to easily deploy machine learning models hosted on the Hub.

ekudarauskas1990 · March 28, 2023, 7:53am

Hello!

I’m making a mobile app for text-to-image and thinking about your Inference Endpoint, but it’s not clear to me regarding the pricing. If I create one endpoint for one model it is only for that model right? I mean if I want to have ability in the app that lets user to switch between couple different models I need to create different endpoints for those models and pay for all of them separately?

Thanks!

ekudarauskas1990 · April 2, 2023, 4:05am

Okey, I found that I can have a multi-model endpoint. Very nice post - Multi-Model GPU Inference with Hugging Face Inference Endpoints

What about callbacks for text-to-image generation, can I track a progress (I want to show it in the app, so user would know aprox time of the generation)? Is it possible? I saw there is Webhooks that you can create on Space but not sure if that the best/only way?

rressler · May 8, 2025, 1:58pm

There are multiple posts/threads about how the inference model endpoint has not been working since 6 May for apps on hugging face while workijg for the same apps not on hugging face spaces. Here is one example. Persistent DNS Resolution Errors

Topic		Replies	Views
Text to image models not yet supported? Inference Endpoints on the Hub	0	551	November 28, 2022
Calling Inference API for image embedding Inference Endpoints on the Hub	0	783	August 28, 2023
Inference Endpoint for batch jobs Inference Endpoints on the Hub	0	291	May 24, 2024
Integration and Scale Inference Endpoints on the Hub	2	53	September 11, 2024
Regarding a Trial Version Inference Endpoints on the Hub	0	208	April 23, 2024

About the Inference Endpoints on the Hub category

Related topics