Getting 429 Error for sentence-transformers/all-mpnet-base-v2

Hi Everyone,

I faced this rate limit error for the first time for sentence-transformers.
429 Client Error: Too Many Requests for url: https://api-inference.huggingface.co/pipeline/feature-extraction/sentence-transformers/all-mpnet-base-v2

Anyone else faced this? How we can avoid this error?

Hi @Shashank47 Rate limits for the Inference API (serverless) reset periodically. PRO / Enterprise organization accounts receive higher rate limits for our serverless inference.

The free Inference API is a solution to easily explore and evaluate models, and Inference Endpoints is our paid inference solution for production use cases. For larger volumes of requests, or if you need guaranteed latency/performance, we recommend using Inference Endpoints instead to easily deploy your models on dedicated, fully-managed infrastructure. Inference Endpoints will give you the flexibility to quickly create endpoints on CPU or GPU resources, and is billed by compute uptime vs character usage. Further pricing information can be found here.