Is there llama3 api for hugging face to use?

alice86 · September 5, 2024, 8:53am

I don’t have any gpu on my PC.
So I want to the api to call. Just like openai、cohere、…
I want to the llama3 api

NatureSon · September 5, 2024, 10:47am

not sure but you can check this. It provides free API inference: https://console.groq.com/playground

John6666 · September 5, 2024, 2:10pm

If you’re looking for free, this space might be useful.
On a $10/month subs, you could also use the 70B Llama3 model for example. (I use it)

rmayormartins · September 8, 2024, 12:18am

Like NatureSon said, I use the groq api playground

nielsr · September 8, 2024, 1:15pm

Hi,

HF provides the serverless Inference API to do just that. It comes with OpenAI-compatible APIs.

Usage is as follows (add your HF token):

# instead of `from openai import OpenAI`
from huggingface_hub import InferenceClient

# instead of `client = OpenAI(...)`
client = InferenceClient(
    "meta-llama/Meta-Llama-3.1-8B-Instruct",
    token=<your-hf-token>,
)

output = client.chat.completions.create(
    model="meta-llama/Meta-Llama-3-8B-Instruct",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Count to 10"},
    ],
    stream=True,
    max_tokens=1024,
)

for chunk in output:
    print(chunk.choices[0].delta.content)

By getting a PRO subscription, you get higher rate limits.

Topic		Replies	Views
How to use llm model's api? Beginners	2	3989	November 14, 2024
Unable to access Llama3.1 model despite having access granted Models	1	498	September 9, 2024
Help using inference endpoint with Llama 3.1 405B Instruct Inference Endpoints on the Hub	1	181	August 30, 2024
To use Llama3.1-405b do I have to rent a server, or can I send my API requests to someone else's server and pay them thrrough HF? Beginners	1	106	August 30, 2024
Create an Assistant to be used via Python scripts Beginners	13	486	September 22, 2024

Is there llama3 api for hugging face to use?

Related topics