Authorization header in inference API

cahya · December 23, 2020, 7:13am

Hi,
I am just wondering what is the purpose of the “Authorization” http header in the inference API. If I remove this header, the request is still working. Example:

curl 'https://api-inference.huggingface.co/models/bert-base-uncased' \
  -H 'Connection: keep-alive' \
  -H 'Content-Type: text/plain;charset=UTF-8' \
  -H 'Accept: */*' \
  -H 'Accept-Language: en,en-US;q=0.9,id;q=0.8,de;q=0.7,ms;q=0.6' \
  --data-binary '{"inputs":"If I am hungry, I will make [MASK]."}' \
  --compressed

julien-c · December 23, 2020, 10:39am

@Narsil and @jeffboudier can chime in, but we offer a certain number of non-authed requests, with IP-based rate limiting.

For production workloads you’ll need a token.

jeffboudier · December 23, 2020, 6:51pm

Hi Cahya! The main purpose of the Authorization http header is to authenticate commercial customers of our Hosted Inference API subscriptions, for production workloads that require models to be preloaded / always available, to enable accelerated inference on CPU and/or GPU, and access to private models. If this service is of interest for your organization, please reach out!

cahya · December 23, 2020, 7:05pm

Thanks @jeffboudier and @julien-c for the explanation.

Topic		Replies	Views
What headers are accepted by Inference API? Beginners	1	417	May 2, 2021
Hosted Inference API Beginners	0	844	March 6, 2023
{'error': 'Authorization header is correct, but the token seems invalid'} Beginners	0	29	August 8, 2024
Authorization header is correct, but the token seems invalid 🤗Tokenizers	3	165	October 10, 2024
Inference endpoint data privacy Inference Endpoints on the Hub	5	3599	April 17, 2023

Authorization header in inference API

Related topics