Pinned model still needs to load

Connorvr · November 18, 2021, 1:57am

Hello,
I have a model pinned. After a short amount of idle time the inference API still needs to load the model, i.e. it returns the message ‘Model <username>/<model_name> is currently loading’. This is not supposed to happen, right? As I understand it, this is the whole purpose of pinning models.

I have confirmed it is indeed pinned through the code:

    request_headers = {
                      'Authorization': 'Bearer {}'.format(<huggingface_token>)
                      }
    pin_url = "https://api-inference.huggingface.co/usage/pinned_models"
    response = requests.get(pin_url, headers=request_headers)

The model is called through the following code:

   api_endpoint = 'https://api-inference.huggingface.co/models/<username>/<model_name>'
   data = json.dumps(payload)
   response = requests.request('POST',
                                api_endpoint,
                                headers=request_headers,
                                data=data)

I feel like I have followed everything in the documentation and don’t understand why it isn’t working.

Thank you in advance for any answers!

dmz · January 13, 2022, 8:28pm

We’re encountering the same issue.

abhig · September 12, 2022, 10:51pm

still not solved

Topic		Replies	Views
Pinning models doesn't seem to work 🤗Hub	5	1826	August 9, 2022
Executing pinned inference model Models	1	312	May 4, 2023
Error executing pinned inference model 🤗Hub	18	3779	December 10, 2021
Inference API Widget wont stop loading for my private model Community Calls	0	267	December 6, 2023
Inference API stopped working for my model 🤗Hub	11	5361	April 26, 2023

Pinned model still needs to load

Related topics