Hugging Face Forums

PRO Plan and for running huge models on free inference api?

jelber2 May 15, 2023, 9:57am 1

Hi,

I was curious about whether the pro plan would enable me to do the following:

curl https://api-inference.huggingface.co/models/HuggingFaceH4/starchat-alpha
-X POST
-d ‘{“inputs”: "Can you please let us know more details about your "}’
-H “Authorization: Bearer xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx”

As I am currently getting

{“error”:"The model HuggingFaceH4/starchat-alpha is too large to be loaded automatically (31GB > 10GB)…}

Best,
Jean Elbers

radames May 15, 2023, 5:34pm 2

This is a really large model, you may need a dedicated hardware , I recommend you looking at our Inference Endpoints - Hugging Face service and reaching out if you need help, thanks

Topic		Replies	Views	Activity
Inference Pro usage in colab Inference Endpoints on the Hub	0	233	April 15, 2024
Inference service for large models, such as Vicuna 13b Beginners	0	1427	May 5, 2023
Cannot run large models using API token Inference Endpoints on the Hub	5	7299	February 22, 2024
Paid API Service Beginners	6	1532	January 6, 2023
Anyone else VERY confused? Community Calls	1	1241	December 19, 2023