How to use Inference API (serverless) in my model page?

LimYeri · June 3, 2024, 1:46pm

Hi All.
I have uploaded my fine-tuned model to Hugging Face. I want to create an inference API (serverless) on the model page, but a timeout occurs. What should I do?

Here is how I wrote the README:
pipeline_tag: text-generation
inference:
parameters:
max_new_tokens: 300
stop:
- <|end_of_text|>
- <|eot_id|>

kayrab · June 28, 2024, 5:53am

Screenshot from 2024-06-28 08-52-33
I am having a similar problem

Topic		Replies	Views
Model loading always times out? Beginners	0	189	August 19, 2024
Inference API timeout Site Feedback	0	187	May 29, 2024
Inference API time out problem...need help Beginners	3	342	February 28, 2024
Inference API time out? Site Feedback	2	903	February 28, 2024
Inference API time out Site Feedback	0	95	July 8, 2024

How to use Inference API (serverless) in my model page?

Related topics