Model works in inference UI, but not on inference API

andrazz · August 9, 2023, 10:11am

Hi,

I’m trying to deploy my model to an inference endpoint on AWS. This model works both through the Huggingface UI/API and locally via diffusers library, both with accelerate and without, but when trying to deploy it as an inference endpoint I’m getting this weird output: If the logs weren’t truncated to one line I’d be able to tell what are the missing keys, but this way the information is very sparse. Anyone has some idea what could be wrong?

7lprw 2023-08-08T09:30:54.564Z ValueError: Cannot load <class 'diffusers.models.autoencoder_kl.AutoencoderKL'> from /repository/vae because the following keys are missing: 
7lprw 2023-08-08T09:30:54.564Z   File "/app/huggingface_inference_toolkit/handler.py", line 17, in __init__
7lprw 2023-08-08T09:30:54.564Z   File "/opt/conda/lib/python3.9/site-packages/starlette/routing.py", line 654, in startup
7lprw 2023-08-08T09:30:54.565Z Application startup failed. Exiting.

Here’s the model in question, as you can see it works normally through the web UI. 8clabs/sketch-model-2 · Hugging Face I created this model by importing a safetensors exported from ComfyUI and then exporting it via diffusers .

Thanks in advance!

Topic		Replies	Views
Key Error when trying to deploy inference endpoint Inference Endpoints on the Hub	2	791	December 3, 2023
Unable to access model with Inference Client Beginners	2	167	May 6, 2025
HuggingFace Endpoint Error on AWS Inference Endpoints on the Hub	2	56	September 1, 2024
HF Inference Endpoints don't finish Initializing Inference Endpoints on the Hub	0	239	March 28, 2024
Inference API stopped working for my model 🤗Hub	11	5376	April 26, 2023

Model works in inference UI, but not on inference API

Related topics