Inference Issue with Llama Models using HF Inference

John6666 · February 6, 2025, 6:37am

When loading a trained model (difference), it seems that an error occurs because it needs to refer to the original model. In short, it can be fixed by passing a token. There are several ways to do this, such as passing it directly with token= or executing login() in advance.

Topic		Replies	Views
How to use gated model in inference Beginners	3	280	September 27, 2024
Inference Endpoints 401 Error Intermediate	2	412	July 15, 2024
Cannot access gated repo Llama-2-7b-hf 🤗AutoTrain	9	11632	November 2, 2024
Unable to access Llama3.1 model despite having access granted Models	1	472	September 9, 2024
When deploying AutoTrained model: "Cannot access gated repo" 🤗AutoTrain	1	691	May 1, 2024

Inference Issue with Llama Models using HF Inference

Related topics