Inference endpoint deployment with custom dockerfile

nurcognizen · January 8, 2024, 11:33am

Hello everyone,

I want to create an inference endpoint with a custom dockerfile.
The last two lines of the dockerfile are:
EXPOSE 7860

CMD [“uvicorn”, “main:app”, “–host”, “0.0.0.0”, “–port”, “7860”] and the deployment process has just failed.
Could you please guide me on how to modify the CMD line correctly, considering I have a single handler.py in the repo? What adjustments should be made to ensure a successful deployment of the inference endpoint with the Dockerfile?

alex-bronze · February 2, 2024, 12:37pm

HI @nurcognizen! Did you find any solution to this problem of creating a custom Dockerfile?

SebastianSchramm · April 23, 2024, 3:31pm

Hi @nurcognizen, I managed to deploy my own custom docker image. Essentially your docker image needs to start a server with a REST API that has at least a /health endpoint and one endpoint for serving your model/logic output. I wrote a short post about how to do with a simple fastapi server: https://www.linkedin.com/pulse/how-build-deploy-custom-docker-image-huggingface-sebastian-schramm-guoqe.

You can also take a look at my github repo with a minimal working example: GitHub - sebastianschramm/fastapi_hf_endpoints: Custom fastapi server packaged as docker image for Huggingface inference endpoints deployment

Topic		Replies	Views
Guidelines for using a Custom Docker Image Inference Endpoints on the Hub	9	1795	May 23, 2024
Help with custom handler.py for model inference endpoint Beginners	1	739	February 24, 2024
Multiple Requests to HuggingFace InferenceEndpoints are not working with custom Docker deployment. :-( Inference Endpoints on the Hub	0	514	March 26, 2024
Creating inference endpoint with custom handler - is this how it should work? Beginners	5	2317	November 27, 2022
Guide/Tutorial to write an inference endpoint for custom models Inference Endpoints on the Hub	5	1773	October 19, 2024

Inference endpoint deployment with custom dockerfile

Related topics