Can't run Janus with HuggingFaceEndpoint

LZY-SPCA · March 6, 2025, 5:56am

I’m trying to integrate huggingface endpoint with LangChain project using HuggingFaceEndPoint proviede by LangChain.

llm = HuggingFaceEndpoint(
    repo_id="deepseek-ai/Janus-Pro-7B",
    task="image-text-to-text",
    max_new_tokens=512,
    do_sample=False,
    repetition_penalty=1.03
)

But it returns with 500 Server Error, and unknown variant ‘any-to-any’. I’m using task like ‘image-text-to-text’ and ‘image-to-text’ and they have same return value.

Not only Janus-Pro-7B can’t run, other VLM like Qwen2.5-VL-7B-Instruct and glm-4v-9b can’t run. I suspect this HuggingFaceEndpoint could not run VLM.

John6666 · March 6, 2025, 6:00am

Oh…

This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support any-to-any models for transformers library.

LZY-SPCA · March 6, 2025, 6:16am

Thanks! What VLM can I use with HF inference API?

John6666 · March 6, 2025, 6:18am

This?

John6666 · March 6, 2025, 6:18am

This? Qwen/Qwen2-VL-7B-Instruct

LZY-SPCA · March 6, 2025, 6:23am

Sorry!This model return thishuggingface_hub.errors.HfHubHTTPError: 503 Server Error: Service Temporarily Unavailable for url: https://router.huggingface.co/hf-inference/models/Qwen/Qwen2.5-VL-7B-Instruct/v1/chat/completions
And I can exclude network reason because I can run microsoft/Phi-3-mini-4k-instruct

Topic		Replies	Views
HF Inference Endpoints Error 429 Inference Endpoints on the Hub	2	65	March 27, 2025
Inference API stopped working Inference Endpoints on the Hub	50	3415	June 8, 2025
Request to Serverless Inference API failed with 400 status code Inference Endpoints on the Hub	2	223	March 4, 2025
Inference Endpoints - No working code examples Inference Endpoints on the Hub	3	144	January 29, 2025
Just don't get it: OpenAI API in Open WebUI Beginners	2	39	June 12, 2025

Can't run Janus with HuggingFaceEndpoint

Related topics