Inference Model with API and Integrate to LM (Language Model)

ridhoalattqas · June 7, 2022, 3:47am

I already following @patrickvonplaten article article that made a n-grams model from our dataset and i already success to inference with it on local model. But i had a problem with inference time on my cpu device on local, so i just push my model on huggingface my-model and decided to inference via API, with this way :

headers = {"Authorization": f"Bearer {API_TOKEN}"}
API_URL = "https://api-inference.huggingface.co/models/ridhoalattqas/xlrs-best-lm"

def query(audio_bytes):
    response = requests.request("POST", API_URL, headers=headers, data=audio_bytes)
    return json.loads(response.content.decode("utf-8"))

but i facing another problem that the language_model that i already pushed on my-model is not linked when i do inference.

Do you guys any suggestion to do that API inference because i had a plan to subscribe for better limitation if its works

Thank you

Topic		Replies	Views
Inference API works for flan-t5-xxl, but not for many other models I have tried with Jupyter/VSCode 🤗Transformers	0	366	June 15, 2023
Inference API detailed request Beginners	5	2265	September 11, 2020
Inference API offline model limit 🤗Transformers	1	919	May 2, 2024
Example Inference API (model & code ), pls Beginners	5	35	June 28, 2025
How to use llm model's api? Beginners	2	3049	November 14, 2024

Inference Model with API and Integrate to LM (Language Model)

Related topics