Pinning models doesn't seem to work

dmz · January 14, 2022, 1:40am

We’ve pinned models (both using the API call and using the dashboard at Dashboard - Hosted API - HuggingFace), but still get “currently loading” errors when we try to make inference API calls.

One example: model https://huggingface.co/redwoodresearch/redwood_deberta-v3-sift_82b19d290a74410caa804fa47e94a80b (private but we can make it public if that would help). It’s currently (supposedly) pinned but still requires a minute of warmup after a period of inactivity.

Let me know if there’s anything we should do differently!

dmz · January 20, 2022, 3:56am

@Narsil Seems like you’ve worked on pinned models before - any chance you could take a look?

davidkong0987 · April 5, 2022, 5:37pm

Were you able to solve this problem?

metamyth · July 27, 2022, 8:09am

hey same problem. pinned model always loads up when at rest for some time. any solutions here?

Cbelem · August 5, 2022, 6:02pm

I have the same problem when pinning the model allenai/tk-instruct-11b-def, never even loaded. However, it is not working in the model’s page at hugging face which may suggest it is a problem on HuggingFace’s side.

On the other hand, I tried pinning a smaller model (the 3B) version and it worked like a charm

Narsil · August 9, 2022, 2:17pm

@Cbelem ,

Actually pinning works. It’s just that this model is too big to be loaded by default.
What is actually failing is the detection that this model is too big to be loaded by the machines we’re using. In order to run these large models we need to discuss it (Since we need different hardware than the standard one).

Hope that answers your questions.

Cheers,
Nicolas

Topic		Replies	Views
Pinned model still needs to load Beginners	2	590	September 12, 2022
Error executing pinned inference model 🤗Hub	18	3779	December 10, 2021
Executing pinned inference model Models	1	312	May 4, 2023
Inference API stopped working for my model 🤗Hub	11	5363	April 26, 2023
Pinned model loading message 🤗Transformers	0	249	September 18, 2022

Pinning models doesn't seem to work

Related topics