The current limits are as follows, and please note that the models that can be used with the free or $9/month Serverless Inference API are effectively limited to those marked as Warm.
There are no restrictions on the Endpoint API, which is charged on a pay-as-you-go basis, but it is charged on a pay-as-you-go basis.
Signed-up Users | 1,000 requests per day |
---|---|
PRO and Enterprise Users | 20,000 requests per day |