I am in the process of creating an inference endpoint of a gated model.
I went through the documentation but could not find a secure way of passing an authentication token to the custom handler.
It seems that secrets and environment variables are only available in spaces, but not in inference endpoints.
Here is a link to my custom handler and to the corresponding space that works fine thanks to a secret.
I could hard code the token and make the model private to avoid leaking the token but that does not seem right.
Any suggestion?