Adapting a model from Spaces to Inference Endpoint

philschmid · November 24, 2022, 6:28am

You can directly deploy your custom inference pipeline as an inference endpoint using a custom container. This would mean creating a model repository with your space code and then using a custom docker with gradio. Here is an example: philschmid/space-naver-donut-cord · Hugging Face.

Additionally, as you mentioned, you can create a custom inference handler. In the documentation, we have several examples on how to do this: * Optimum and ONNX Runtime

Or you can use the API from your Spaces

Topic		Replies	Views
Can inference endpoints be used in Spaces? Inference Endpoints on the Hub	1	889	September 25, 2023
How to connect Inference Endpoint to Model Card Inference Endpoints on the Hub	9	958	October 16, 2024
How to deploy a space on inference endpoint for autoscaling? Inference Endpoints on the Hub	0	347	August 23, 2023
Using a space as a backend? Spaces	7	3947	November 20, 2024
Deploy model on HF Space for production Spaces	0	991	March 11, 2022

Adapting a model from Spaces to Inference Endpoint

Related topics