Hello Hugging Face Support,
We are deploying vLLM in Kubernetes and trying to access the gated model mistralai/Mistral-7B-Instruct-v0.1.
- Our account is granted access (the model page confirms this).
- We created a fine-grained token with “Read access to contents of all public gated repos you can access”.
- The token works perfectly with the Python client and CLI on a VM.
- The token is correctly injected into our pod as HUGGINGFACE_HUB_TOKEN (we verified this).
- But inside the pod, the model download fails with a 401 Unauthorized error.
We have restarted pods, updated secrets, and confirmed the environment variable is correct.
This appears to be a backend issue with token authentication for gated models in Kubernetes.
Could you please investigate or advise?
Thank you,
Ashutosh Kumar (wipro-gcp-ashu)