Multiple Requests to HuggingFace InferenceEndpoints are not working with custom Docker deployment. :-(

skumar1998 · March 26, 2024, 8:33pm

Hi coders,
I created a fastapi server with some different endpoints and tried deploying on HuggingFace InferenceEndpoint using Docker Image Config; Everything worked well and now comes to testing part:

When I send request one after another it is working well without throwing any errors.
BUT when I send multiple requests using python requests and concurrent.futures.multithreading it throws errors for some calls but most of call are failing. Getting errors : service unavailable 503
BUT when I deploy the some custom code by using custom handler.py then it is not throwing any error. I tested this one by sending 100 requests simultaneously and worked like a charm. Of course, it was waiting to get their term but it didn’t fail.

Can anybody explain me what’s wrong with the custom Docker image?

FYI: I tried few things but didn’t work.

using uvicorn , tested with multiple concurrency even that didn’t work.
using requests module to call the apis/endpoints.
when I create multiple replicas then it seems to be working as very few requests are failing in that case but not working 100%.

Thanks, for spending your time to read this post. I truly appreciate any suggestions or recommendations. Thanks again!! I’m expecting a response back from someone.

Topic		Replies	Views
Custom image endpoint 404 Inference Endpoints on the Hub	0	296	January 24, 2024
Inference endpoint deployment with custom dockerfile Inference Endpoints on the Hub	2	844	April 23, 2024
My inference endpoint went from 1 second to 20-30 seconds, even example Beginners	2	34	February 25, 2025
Bart and Hugging Face Inference Endpoint working synchronously - can you help me? Inference Endpoints on the Hub	1	112	July 1, 2024
Guidelines for using a Custom Docker Image Inference Endpoints on the Hub	9	1843	May 23, 2024

Multiple Requests to HuggingFace InferenceEndpoints are not working with custom Docker deployment. :-(

Related topics