Autoscaling is turned on to min replicas as 0. Yet costing money?

Kishal · August 11, 2023, 10:00am

I have turned on the inactivity after 15 mins. But still it was costing me money when there were no incoming traffic.

amitrethem · August 11, 2023, 11:17am

Hello, Is there any way to raise request as a support defect one? As per HF documentations, we should not be charged if the inference endpoint is idle for more than 15 min. Is this not applicable for CPU and GPU based endpoint? Thanks

michellehbn · August 11, 2023, 4:54pm

Hi @Kishal, Thanks for reaching out and sorry to hear about this issue. Could you please send us an email to: api-enterprise@huggingface.co ? We’ll need the name of the endpoint in question and any details if possible. We’ll take a look to see what happened. Thanks again!

Topic		Replies	Views
Inference Endpoint not starting on HTTP request Inference Endpoints on the Hub	2	278	March 6, 2024
Autoscaling on inference endpoints not initializing from 0 replicas Inference Endpoints on the Hub	2	403	June 27, 2024
Inference endpoint "failed" and then "deleted" Inference Endpoints on the Hub	1	409	March 8, 2024
Misunderstanding about inference endpoint billing Beginners	2	767	February 5, 2025
Does autoscaling to zero prompt rebuild every time it receives a new request? Inference Endpoints on the Hub	0	215	January 30, 2024

Autoscaling is turned on to min replicas as 0. Yet costing money?

Related topics