Pricing for Huggingface Endpoint

baniasbaabe · September 4, 2023, 9:18am

I am a bit confused about the pricing model. Let’s say I deploy a model on a CPU Basic machine ($0.06/hour). So do I pay as long as the model is deployed or do I pay only for the compute time (e.g. I make 2 requests and every request takes 10 seconds to run, so do I only pay for the 20 seconds). I ask because I want to do a hobby project but don’t want to pay a lot for it. Alternatives like cerebrium only offer 3 models for their starter version :')

philschmid · September 4, 2023, 11:12am

You can find the information in the documentation: Pricing

had123 · November 21, 2023, 11:20am

Hello,
I have the same question as @baniasbaabe, I don’t understand how you calculate the price. Can you please give us more clarity? If I subscribe for one month, will I have to pay for 730 hours or only the number of hours that I have interactions with the endpoint?

YKinebas · December 21, 2023, 8:24pm

From the link above:

At the end of the subscription period, the user or organization account will be charged for the compute resources used while Endpoints are initializing and in a running state.

So only compute time

DigitalMantis · March 10, 2024, 5:35pm

so does it cost less to keep a endpoint in a running state?

meganariley · March 22, 2024, 9:21pm

Hi @DigitalMantis In order to stop incurring cost on running endpoints, you’ll need to delete or pause them. We also have an automatic scaling to 0 feature!

You can check usage and billing at any time in your billing settings.

Hope this helps!

langhoangal · February 5, 2025, 12:34am

After deployment, I tested with 9 request within few minutes. And later (after few hours) I saw the compute time is 46 minutes. Trying another service for now.

Topic		Replies	Views
Misunderstanding about inference endpoint billing Beginners	2	776	February 5, 2025
How Huggingface pricing works for model Deployment? Inference Endpoints on the Hub	2	3283	October 20, 2023
How Can I Understand the Exact Cost of My Inference API Requests? Intermediate	2	133	April 16, 2025
Why it continuing add pricing Inference Endpoints on the Hub	4	70	October 24, 2024
Inference Providers: 3 cents per request? Beginners	4	335	March 12, 2025

Pricing for Huggingface Endpoint

Related topics