Which plan i use

The current limits are as follows, and please note that the models that can be used with the free or $9/month Serverless Inference API are effectively limited to those marked as Warm.
There are no restrictions on the Endpoint API, which is charged on a pay-as-you-go basis, but it is charged on a pay-as-you-go basis.

Signed-up Users 1,000 requests per day
PRO and Enterprise Users 20,000 requests per day