Guidelines for using a Custom Docker Image

alex-bronze · February 2, 2024, 11:11am

Hi!

I am trying to use Inference Endpoints for model deployment in production. I have already succeed in deploying a model using the Default Container type. This means that I already have all the necesary setup including a custom Handler.py working.

The issue is that, the necessary final step involves installing a given package that is private. This means that it will not be properly installed from the requirements.txt. I saw that one can indicate a custom Docker Image which will solve easily this issue. However, I have been triying and never been able to initialize the Endpoint. It does not even show any logs.

What should be the content of the Dockerfile? I was assuming something like this:

FROM <my_docker_image>

# WORKDIR
WORKDIR /repository
ADD . /repository

# EXPOSE PORT XXX
EXPOSE XXX

CMD ["python", "handler.py"]

If I should indicate any additional information I will be happy to share.

Thanks for your help.

grim-metal · February 4, 2024, 10:56pm

I’m not sure if HF has support for private packages. What kind of model are you trying to deploy?

David394 · February 4, 2024, 11:02pm

Could be an error due to hugging face not accessing the docker image, can you verify it did so?

alex-bronze · February 5, 2024, 8:56am

@grim-metal It will only support them if you provide a custom Docker Image. I am deploying a YOLO, but I will need a set of private utils that I would like to encode in the Docker image. Anyway apart from being able to do so, running the endpoint using a custom Docker Image provides another powerful advantage: you dont have to install custom dependencies from the requirements.txt every time a new replica is up. This consumes a significant amount of time that will harm the autoscaling speed. I need to upgrade PyTorch to v2, which means that the installation of that, and other requirements takes almost 3-5min to complete.

alex-bronze · February 5, 2024, 9:47am

@David394 I am almost sure that the Docker Hub credentials are properly setup, anyway, I will double check that also.

grim-metal · February 5, 2024, 5:08pm

If it’s a YOLO model, and you want it in a container behind a REST API, you could try using Modelbit instead. Private packages | Modelbit Documentation

amosyou · April 12, 2024, 1:21am

@alex-bronze Were you able to figure out the contents of the custom Dockerfile?

alex-bronze · April 12, 2024, 7:34am

Hi @amosyou!

Unfortunately I was not able to figure out the details… I was having issues with the autoscaling feature so I decided to give up an move to AWS Sagemaker Endpoints…

Anyway, if you find the details, would you mind to share them?

Thanks!

SebastianSchramm · April 23, 2024, 3:28pm

Hi @alex-bronze, not sure if this is helpful to you but I managed to deploy my own custom docker image. Essentially your docker image needs to start a server with a REST API that has at least a /health endpoint and one endpoint for serving your model/logic output. I wrote a short post about how to do with a simple fastapi server: https://www.linkedin.com/pulse/how-build-deploy-custom-docker-image-huggingface-sebastian-schramm-guoqe.

You can also take a look at my github repo with a minimal working example: GitHub - sebastianschramm/fastapi_hf_endpoints: Custom fastapi server packaged as docker image for Huggingface inference endpoints deployment

musarehmani100 · May 23, 2024, 7:42am

Hey @alex-bronze @David394 @grim-metal @SebastianSchramm @amosyou
I want to deploy an existing docker image from docker hub and it’s public. I’ve searched everywhere but I couldn’t find the Dockerfile of that image. How do I deploy the image on huggingface spaces using docker commands?

Topic		Replies	Views
Inference endpoint deployment with custom dockerfile Inference Endpoints on the Hub	2	816	April 23, 2024
Endpoint deploy fails - No custom pipeline found at /repository/handler.py Inference Endpoints on the Hub	1	1026	November 23, 2022
Custom containers - setting args Inference Endpoints on the Hub	0	246	November 7, 2023
Custom Inference handler.py: FileNotFoundError Inference Endpoints on the Hub	8	811	April 8, 2024
How to access binary files in for custom inference endpoints? Inference Endpoints on the Hub	1	277	November 6, 2023

Guidelines for using a Custom Docker Image

Related topics