What is 'model is currently loading;'

Doogie · January 20, 2022, 3:24am

Hi.
I’m a beginner at NLP.
I’m going to summarize sentences using T5 model’s information API.
The model is currently loading keeps popping up and does not proceed.
Can you tell me what this error is?

also
I want to summarize more than 5,000 characters into 1,000 to 2,000 characters.
How should I write the parameters?

merve · January 20, 2022, 1:23pm

@Doogie
Hello
Inference API loads models on demand, so if it’s your first time using it in a while it will load the model first, then you can try to send the request again in couple of seconds.

osanseviero · January 20, 2022, 5:30pm

As per Detailed parameters — Api inference documentation, you can use the wait_for_model option to wait for the response instead of having to do multiple requests.

Doogie · January 21, 2022, 1:41am

Thank you for answering.

I hope this model is always ready.
What should I do to do that?
Model should always show me response immediately.

Can i pinned the model?

Doogie · January 21, 2022, 1:45am

Thank you for answering!

i hope this model is always ready
wait_for_model parameters means ready to model?

osanseviero · January 21, 2022, 10:48am

wait_for_model is documented in the link shared above.

If false, you will get a 503 when it’s loading. If true, your process will hang waiting for the response, which might take a bit while the model is loading. You can pin models for instant loading (see Hugging Face – Pricing)

deseipel · October 3, 2022, 1:22am

I get a message that wait_for_model is no longer valid

{‘inputs’: {‘past_user_inputs’: , ‘generated_responses’: , ‘text’: ‘yo’}, ‘parameters’: {‘min_length’: 1, ‘max_length’: 500, ‘repetition_penalty’: 50.0, ‘temperature’: 50.0, ‘use_cache’: True, ‘wait_for_model’: True}}
{“error”: “The following model_kwargs are not used by the model: [‘wait_for_model’] (note: typos in the generate arguments will also show up in this list)”}

GillesM · April 3, 2023, 2:28pm

Hi. With “togethercomputer/GPT-NeoXT-Chat-Base-20B” I’m using the “wait_for_model” parameter set to true, but I still have the “Model is currently loading”. Is it because the model is too big ?

tomwagstaff-opml · May 18, 2023, 9:01am

I’ve set the wait_for_model parameter to True in the payload in the same way as @deseipel and it doesn’t work for me either. I don’t get a specific error about the request, I just get the usual 503 error in response: “Model is currently loading”.

tomwagstaff-opml · May 18, 2023, 9:51am

I have finally managed to use the flag - the “parameters” dictionary actually needs to be called the “options” dictionary according to the documentation https://huggingface.co/docs/api-inference/detailed_parameters

However, after overcoming that error and duly waiting for a response, I waited, and got a 504 error instead: Gateway Timeout. Is HF down today?

richardantao · January 31, 2024, 2:25am

It’s not that the ‘parameters’ dictionary needs to be called ‘options’; rather, there is a separate argument from ‘parameters’ called ‘options’ that specifically takes the ‘use_cache’ and ‘wait_for_model’ keys in the dict.

Topic		Replies	Views
Pinned model still needs to load Beginners	2	590	September 12, 2022
Error - Model is loading Inference API Beginners	0	151	June 5, 2024
Inference API Widget wont stop loading for my private model Community Calls	0	266	December 6, 2023
'Model is currently loading" problem for microsoft/speecht5_tts interference API Beginners	0	92	June 28, 2024
Inference API offline model limit 🤗Transformers	1	919	May 2, 2024

What is 'model is currently loading;'

Related topics