Deploying models onto spaces

TS0001 · May 29, 2023, 5:28pm

I have tried to deploy many of the models on the LLM leaderboard (Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4) onto a space as a Gradio app. The deploy looks ok, but all the models I have tried don’t return a response, they just time out.

I have a paid account, and am selecting an A10G for the hardware.

I am sure I am making a rookie mistake, any help would be much appreciated.

tmarkov · August 1, 2023, 11:43pm

Bump, because I’m having the exact same problem.

I’ve tried to deploy a 7B model (Ejafa/vicuna_7B_vanilla_1.1 · Hugging Face) onto T4 medium hardware, and it won’t produce response to even simple prompts. Stuck on “processing” for minutes.

Code I used to deploy is as suggested by HF interface:

import gradio as gr

gr.Interface.load("models/Ejafa/vicuna_7B_vanilla_1.1").launch()

TS0001 · August 1, 2023, 11:57pm

I gave up trying to deploy on HF in the end, and switched to Runpod. This tutorial was my starting point.

Topic		Replies	Views
Need help with deploying my model on spaces Spaces	1	140	November 21, 2024
Deploy model on HF Space for production Spaces	0	987	March 11, 2022
Timeout while deploying my model Beginners	0	622	February 11, 2023
Increasing Response time for Gradio api Spaces	3	314	September 6, 2024
How to connect Inference Endpoint to Model Card Inference Endpoints on the Hub	9	943	October 16, 2024

Deploying models onto spaces

Related topics