Is it possible to run and stop an endpoint using code/API to avoid be billed when it is not used?

smartinezbragado · January 23, 2023, 9:27am

I want to use the HF inference endpoint for my project but, since the model will be used just few hours per day I want to launch the endpoint and stop it within the same day. Is it possible?

radames · January 23, 2023, 8:54pm

hi @smartinezbragado , currently it’s no possible, but it’s on a short term milestone to implement it.

ronvolutional · February 17, 2023, 3:47am

Happy to let you know we’ve just made this possible!: Pause and Resume your Endpoint
You can Pause/Resume as often as you’d like to only be billed while you need the model.

smartinezbragado · February 20, 2023, 10:50pm

Thanks @radames and @ronvolutional . I am reading the documentations of the API and I did not find anything to pause and resume the endpoint (only downscale it to 0). Is it possible to pause/resume it through API or only manually?

Thanks in advance

radames · February 22, 2023, 1:48am

just copying and pasting @philschmid response from discord here

curl --request PUT \
 --url https://api.endpoints.huggingface.cloud/endpoint/ENPOINT-NAME \
 --header 'Authorization: Bearer TOKEN' \
 --header 'Content-Type: application/json' \
 --data '{
 "compute": {
  "scaling": {
   "minReplica": 0,
   "maxReplica": 0
  }
 }
}'

mperesson · June 19, 2023, 12:25pm

This does not work anymore. I get a 400 error.

The dashboard get the exact same error when clicking on the “Stop endpoint” button.
I guess it’s linked to the new “Automatic Scale-to-Zero” option.

Any idea on how to pause/resume endpoints now?

mcpotato · June 19, 2023, 3:10pm

Hello, we’ve indeed changed the way you pause/stop an endpoint. There are two new routes, /pause and /resume.

You can check the swagger docs here, let me know if there is anything else I can do to help.

Topic		Replies	Views
Autoscaling is turned on to min replicas as 0. Yet costing money? Inference Endpoints on the Hub	2	508	August 11, 2023
Please help! Scaling up from 0 programmatically Beginners	1	241	June 3, 2024
Inference endpoint "failed" and then "deleted" Inference Endpoints on the Hub	1	409	March 8, 2024
Inference endpoints cost Beginners	1	540	November 1, 2022
Inference turned off for this model? Inference Endpoints on the Hub	1	1653	August 15, 2023

Is it possible to run and stop an endpoint using code/API to avoid be billed when it is not used?

Related topics