Cannot Deploy My Private Model for Inference Endpoint

weiqis · April 21, 2023, 2:10pm

Hi,

I have a private model that I’d like to deploy in Inference Endpoint. But I cannot find it from the model search box. Is it impossible to deploy a private model or do I miss anything?

I already turned on inference in the model metadata:

license: creativeml-openrail-m
language:
  - en
pipeline_tag: text-generation
tags:
  - endpoints
  - text-generation-inference
inference: true

Any suggestions will be appreciated. Thanks.

philschmid · April 21, 2023, 2:29pm

Hello @weiqis,

Private models can be deployed in the account that “owns” the repository. Meaning if you use an organization on inference endpoints, the model needs to be part of the organization.

Also to mention we had a bug until this afternoon where private repos where showing anymore but that’s fixed.

weiqis · April 21, 2023, 2:32pm

Hi @philschmid , thanks for your quick reply. But the model is under my own account, and I’m not currently in any organization yet. My inference endpoint is also under my own account. So I have the full ownership of the private model. I just checked and I’m still not able to see my models.

philschmid · April 21, 2023, 3:00pm

Can you share a screenshot of it?

weiqis · April 21, 2023, 3:26pm

Absolutely. Here is my model (I don’t have any other information other than the metadata in my model card):

And here is the inference engine where I tried to search my model:

beurkinger · April 24, 2023, 12:01pm

Hi @weiqis ! The issue should now be fixed. Please tell us if you are now able to import your private models.

Best,
Thibault

weiqis · April 24, 2023, 1:32pm

Yep, I can see all my private models now. Thanks!

Topic		Replies	Views
Can't create endpoint for private model Inference Endpoints on the Hub	3	823	October 30, 2023
Protecting a private model when using inference endpoints Inference Endpoints on the Hub	6	545	March 29, 2023
Endpoint deploy fails - No custom pipeline found at /repository/handler.py Inference Endpoints on the Hub	1	1026	November 23, 2022
Guidelines for using a Custom Docker Image Inference Endpoints on the Hub	9	1770	May 23, 2024
Problem to deploy endpoint Inference Endpoints on the Hub	3	303	July 19, 2024

Cannot Deploy My Private Model for Inference Endpoint

Related topics