Deploying my own custom Llama model to production using Hugging Face

navyanshmahla · December 9, 2023, 3:31pm

Hello everyone,

Our university project involves deploying a custom fine-tuned Llama model for our institute’s chatbot. I’m new to Hugging Face and seeking guidance on deploying the model. We’ve prepared the Chat UI and the model itself. We want this chatbot to be used by everyone. So ofc we want to deploy the model using some cloud service to make inferences using API. Scalability is crucial; we need to handle multiple queries simultaneously and the model should accommodate large context lengths within the prompt. What Hugging Face service would be best considering both functionality and cost-effectiveness?

Topic		Replies	Views
Chat UI With LLAMA 2 Models	1	1215	May 6, 2024
HuggingFace API Beginners	0	159	July 31, 2024
Deploy model in hugging face platform Beginners	0	257	December 18, 2023
How to use llm model's api? Beginners	2	2836	November 14, 2024
How to deploy project / MOdel on huggin face Models	2	110	August 28, 2024

Deploying my own custom Llama model to production using Hugging Face

Related topics