Inference Endpoints - Best thing since sliced bread?

awacke1 · September 13, 2023, 2:13pm

I want to pass my gratitude and appreciation regarding Inference Endpoints which is one of the most useful features in ML today available anywhere. I started a test of both Whisper and Llama small models running on T4 and A10 respectively which seems a perfect cost/benefit fit for those two models in an end to end speech to text to LLM to speech pipeline allowing you to speak to a LLM. Thanks and Kudos to the team!!!

Demo space with all in AI pipeline: 🐪DromeLlama🦙 Chat WhisperLangchain 🌟FAISS Embeddings - a Hugging Face Space by awacke1

Hatman · September 20, 2023, 1:09pm

They are pretty great for workloads not in production. Kind of like beads thrown out at Mardi Gras. Sometimes it’s the best and others the gutter.

Topic		Replies	Views
Regarding a Trial Version Inference Endpoints on the Hub	0	208	April 23, 2024
About the Inference Endpoints on the Hub category Inference Endpoints on the Hub	3	1652	May 8, 2025
To create "Inference Endpoints" Beginners	0	120	January 15, 2024
ASR on inference endpoints Intermediate	1	380	February 11, 2024
Problem to deploy endpoint Inference Endpoints on the Hub	3	303	July 19, 2024

Inference Endpoints - Best thing since sliced bread?

Related topics