I’m trying to understand how organization billing works. First some context. We are using an open source model and do not host anything in the organization. Using an API and pay for usage seems ideal. A credit card was added to the organization but not the user accounts.
There are two usage patterns:
- Low volume requests from a developer.
- High volume requests from a batch job. This is in the range of 10’s of thousands of requests.
The Inference API is the first in the list when deploying a model. The pricing page does not mention anything about the Inference API. This could be because it’s advertised as “free”. Is there a paid tier? After much digging, I found the Inference for PROs blog where it mentions increased rate limits and a few other benefits over the “free” tier. Can an organization pay for a user’s PRO account? How does that work? Note: The pricing page does not mention multiple Inference API tiers. Also, the Accelerated Inference API does not mention the PRO version. This is very confusing!
The Inference API section about Parallelism and batch jobs mentions Spaces, so let’s talk about Spaces. The Spaces overview and Spaces page imply Spaces is meant for demos and portfolio apps. This goes against the implication that Spaces can be used by batch jobs. If Spaces is suitable for batch jobs, how does the billing work? Org API keys are deprecated. Will Spaces know to use an organization billing if the user billing is empty? How does that work?
The Spaces GPU Upgrades documentation has a section on billing but it doesn’t state anything about organization billing. The Billing documentation hints that organization billing is only for Enterprise Hub subscriptions since “PRO subscription” is “for users” and “Enterprise Hub subscriptions” are “for organizations”. If I’m only interested in using the API (to access hosted models), is it necessary to subscribe to Enterprise Hub for the sole purpose of moving the billing from the user to the organization?
I haven’t even gotten to Inference Endpoints which requires managing IAM permission for EC2 instances created by Hugging Face. I don’t want to deal with the EC2 permissions, and if I can use an API to access a hosted model AND have the organization pay for usage, that would be ideal. Any insight you can provide is greatly appreciated.