Inference endpoint data privacy

asresearch7428 · April 17, 2023, 4:08pm

Ok. It seems my assumption was wrong. From what I can tell, using a paid inference endpoint involves paying to instantiate a compute instance and keep it alive, incessantly. I was hoping for something more “on demand,” but it makes sense, I suppose. I think the inference API is a better fit for the early stages of my project.

Phil, (or someone else who knows) would you mind explaining a little more about how and if payloads and tokens are stored and accessed on the inference API? My goal is to interact with my chosen model, regarding proprietary projects, and privacy and security are, thus, important. I can’t find this information in the documentation. Also, what are usage/rate limits?

Sorry for all the questions, and thanks in advance for your support!

Topic		Replies	Views
Rate Limit and Privacy for Inference API Inference Endpoints on the Hub	0	3909	May 8, 2023
Sensitive data privacy / gathering Spaces	2	526	April 7, 2025
Secrets for custom inference endpoint? Inference Endpoints on the Hub	2	490	January 8, 2024
Full log history endpoint Inference Endpoints on the Hub	0	157	February 17, 2024
How can I get the logits from an endpoint call? Inference Endpoints on the Hub	3	234	August 30, 2024

Inference endpoint data privacy

Related topics