How to set threshold for inference endpoint object detection

schoonhovenra · May 23, 2024, 3:29pm

I am deploying models using inference endpoints and I can successfully get responses from it. However, in the Testing playground you can set a timeout and threshold, also the ObjectDetectionPipeline has a threshold and timeout argument. However, the predictions appear to be identical no matter what threshold I set.

I also could not find documentation on how to give a timeout/threshold in the CURL request:

curl "https://Something.us-east-1.aws.endpoints.huggingface.cloud" \
-X POST \
--data-binary '@cats.jpg' \
-H "Accept: application/json" \
-H "Authorization: Bearer hf_XXXXX" \
-H "Content-Type: image/jpeg" \

How do I control the threshold of prediction via such a CURL request? And if it is always fixed, why does it seem it only returns predictions with a score >0.9?

Topic		Replies	Views
How to have custom output size for inference API Inference Endpoints on the Hub	4	1311	February 16, 2023
Issue with Salesforce/blip-image-captioning-large Endpoint: "input_ids or inputs_embeds" Error Inference Endpoints on the Hub	1	464	December 12, 2023
My inference endpoint went from 1 second to 20-30 seconds, even example Beginners	2	32	February 25, 2025
Inquiry About 120s Timeout on Hugging Face Inference Endpoint for Llama 3.1-8B Models	1	31	March 28, 2025
HuggingFace Inference endpoint 504 error Inference Endpoints on the Hub	3	801	January 30, 2024

How to set threshold for inference endpoint object detection

Related topics