Need help as I am trying for deployment but not luck

wipro-gcp-ashu · July 9, 2025, 3:31pm

Hello Hugging Face Support,

We are deploying vLLM in Kubernetes and trying to access the gated model mistralai/Mistral-7B-Instruct-v0.1.

Our account is granted access (the model page confirms this).
We created a fine-grained token with “Read access to contents of all public gated repos you can access”.
The token works perfectly with the Python client and CLI on a VM.
The token is correctly injected into our pod as HUGGINGFACE_HUB_TOKEN (we verified this).
But inside the pod, the model download fails with a 401 Unauthorized error.

We have restarted pods, updated secrets, and confirmed the environment variable is correct.

This appears to be a backend issue with token authentication for gated models in Kubernetes.

Could you please investigate or advise?

Thank you,
Ashutosh Kumar (wipro-gcp-ashu)

John6666 · July 9, 2025, 11:00pm

HUGGINGFACE_HUB_TOKEN

There are several possible causes, but I think this is the most likely one. Currently, it is common to use HF_TOKEN.

Topic		Replies	Views
Token not working Beginners	1	57	June 19, 2025
401 Error when trying to upload model from local machine Beginners	3	99	June 17, 2025
Invalid token or no access to Hugging Face Beginners	3	1937	May 29, 2025
Access issues for gated repos Beginners	3	4001	August 23, 2024
Acces token dont work Beginners	3	459	January 6, 2025