Raise Inference Client GB Limit

ThatOneCoder · July 14, 2024, 5:12am

Hello, is there by any chance a way to increase the original 10 GB limit placed on Inference Client for enterprises on HF? Thanks.

meganariley · July 19, 2024, 7:02pm

Hi @ThatOneCoder Currently there is a 10GB limit placed on the Inference Client.

The free Inference API is a solution to easily explore and evaluate models, and Inference Endpoints is our paid inference solution for production use cases. For larger models or volumes of requests, or if you need guaranteed latency/performance, we recommend using Inference Endpoints instead to easily deploy your models on dedicated, fully-managed infrastructure.

Further pricing information can be found here.

ThatOneCoder · July 20, 2024, 8:05am

Okay, thanks.

system · September 21, 2024, 8:41pm

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Question about Hugging face inference API Beginners	1	1930	May 6, 2024
Paid API Service Beginners	6	1546	January 6, 2023
PRO Plan and for running huge models on free inference api? Beginners	1	1812	May 15, 2023
Inference service for large models, such as Vicuna 13b Beginners	0	1432	May 5, 2023
What should I do when I meet rate limit? Beginners	3	2140	July 9, 2024

Raise Inference Client GB Limit

Related topics