$9 Pro for api inference and cost

theoracle · April 10, 2024, 9:02am

I am subscribed to the $9 Pro plan and I am using for generating a synth dataset, so a bit of heavy usage here.

https://api-inference.huggingface.co/models/mistralai/Mixtral-8x7B-Instruct-v0.1

I just wanted to find out for sure that I am not charged anything else than the montly subscription for using it. This is not a dedicated inference api endpoint, but the normally available one.
There is no indication in the billing section about this usage. I just don’t want surprises.

Metin · May 10, 2024, 8:54am

I’m also interested in this, as I heavily rely on the Inference API (making 1 request per 10 seconds for 24 hours). I searched the documentation but couldn’t find relevant information.

For reference, here’s the code I use to send requests:

client = AsyncInferenceClient("meta-llama/Meta-Llama-3-8B-Instruct")
chat = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hi, can I reach the moon by jumping?"},
]

response = await client.chat_completion(chat, max_tokens=100, temperature=0.1)

theoracle · May 10, 2024, 9:10am

I run quite a bit of inference and I was charged only the $9 a month, so it seems…

Topic		Replies	Views
Inference API returns Unkown Error 🤗Hub	1	656	November 15, 2021
Unlimited API usage for models Beginners	4	5712	May 7, 2021
Inference API Usage Not Updating in Billing Overview (Pro Plan) Beginners	1	83	February 1, 2025
Pro Account $2 inference limit Beginners	8	1354	March 23, 2025
How to reduce Inference API costs for long format text generation? Beginners	0	565	June 11, 2021

$9 Pro for api inference and cost

Related topics