Hitting rate limits with Inference Providers

simon29 · September 9, 2025, 5:46pm

Hi

We are using HuggingFace Inference Providers (with Novita AI) and getting “code”:429, “reason”:“RATE_LIMIT_EXCEEDED” API responses.

We’re calling the API ~300 times per minute but would like to call it 400 times per minute. Is it possible to increase our rate limits?

Thanks

John6666 · September 9, 2025, 9:59pm

“reason”:“RATE_LIMIT_EXCEEDED” API responses.

Based on the error message, it seems to be getting caught on an error on Novita’s side. In this case, it looks like the only option is to consult Novita’s support…

To raise your rate limit, contact support or use a verified account.

simon29 · September 10, 2025, 7:54am

Thanks so much, I contacted them. It wasn’t clear though whether it was due to HuggingFace or Novita.

John6666 · September 10, 2025, 9:45am

There’s an Inference Providers channel on the HF Discord, so consulting there might be a good option. At the very least, your voice will reach the maintenance team.

simon29 · September 10, 2025, 3:14pm

Thanks @John6666 - You’re very helpful. I can’t manage to validate my account on Discord, it redirects me to the HF login page and says I’m not logged in (even though I am). I’ll send a direct message to lunarflu on Discord as you’ve advised in other threads.

Topic		Replies	Views
Need help for Infernece API rate limiting Beginners	0	324	May 26, 2024
Hugging Face API rate limits Beginners	15	17348	June 11, 2025
Facing Rate Limit issues on the inference API Beginners	1	5760	June 14, 2024
What are the Rate Limits For the Inference API Beginners	0	926	July 10, 2024
Rate Limit Reached without making calls? "Rate limit reached. You reached free usage limit (reset hourly)." 🤗Hub	1	2058	June 13, 2024

Hitting rate limits with Inference Providers

Related topics