API inference limit changed?

charlespdolan · March 12, 2025, 1:00pm

Hi, I just noted the start of “pay as you go” for PRO. (a) Thanks for the quick implementation. The time to update the total due on the billing page is super quick–just as fast as OpenAI and faster than Google or Anthropic. Cudos. (b) It looks like the cost is pretty close to what I saw before, so less competitive than commercial (5-8x), but still reasonable given HuggingFace is non-profit and provides a high quality environment for models that are hard to access otherwise. (c) With this pricing, I can continue to use my PRO subscription to access LLaMa-3.3-70B (or other open source), not as a primary summarization tool, but I can use it as a check when one of the hyper-scalers wiffs on a new summary. Probably closer to what PRO is meant for ;->. (d) At some point, I will try to use an Inference Endpoint on Hugging Face to get a per-story cost for just compute (after one pays to spin up the instance). (e) Thanks again!

Topic		Replies	Views
API Limit in PRO Beginners	2	143	March 5, 2025
Pro Account $2 inference limit Beginners	8	1151	March 23, 2025
Serverless Inference API credits Beginners	2	115	May 19, 2025
Inference API budget, billing limit Site Feedback	18	3192	June 4, 2025
Need help for Infernece API rate limiting Beginners	0	307	May 26, 2024

API inference limit changed?

Related topics