How to deploy model on custom server?

arvind12 · February 20, 2024, 8:56am

Hi! I have finetuned a wav2vec2 on custom data for ASR. How can i deploy it on my own GPU server? what are the possible way to make our own server because cloud is very costly and I cannot afford it. I want to deploy it on my own GPU and want to give my customer an API for using it. how can i scale it to the 1000 of user?
If I deploy the model on my own server, do I need to create 1000 instances of the same model for 1000 customers to use it simultaneously?

Topic		Replies	Views
Model Deploy On-prem Beginners	1	814	March 21, 2024
Deploy multilingual sentence tansformer into cloud Beginners	10	2711	July 16, 2021
Deploying inference model size and performance 🤗Transformers	6	5222	July 9, 2024
Deploying my own custom Llama model to production using Hugging Face Beginners	0	828	December 9, 2023
What is best way to serve huggingface model with API? Beginners	11	42844	August 29, 2023

How to deploy model on custom server?

Related topics