Calculate costs for multiple models in same machine

micuentadecasa · September 5, 2023, 3:01pm

Hi, I need to deploy multiple models (image recognition, llama2, video transcript, etc), and I’m trying to find the cost for this. I have found the pricing of the different servers, but I don’t know how to calculate how many models can I run in the same machine (if it is possible to share a machine with different endpoints), for example, let’s say that I want to use Idefics model, how can I know the machine that I need? how much percentage of the machine is using? etc.
I will appreciate if somebody has some information about this topic.
Regards.

Topic		Replies	Views
Pricing for Huggingface Endpoint Inference Endpoints on the Hub	6	3320	February 5, 2025
About the Inference Endpoints on the Hub category Inference Endpoints on the Hub	3	1652	May 8, 2025
How Can I Understand the Exact Cost of My Inference API Requests? Intermediate	2	137	April 16, 2025
Huggingface hosting cost calculation 🤗Transformers	2	869	September 12, 2023
Misunderstanding about inference endpoint billing Beginners	2	777	February 5, 2025

Calculate costs for multiple models in same machine

Related topics