API inference limit changed?

charlespdolan · March 6, 2025, 1:27pm

Hi,

I had the same experience. I had been using LLaMa-3.3-70B for several months through a PRO subscription. I compare summarization results on news stories (40-70 stories/day, 700-2,500 tokens) for different models/APIs each day, GPT-40, Gemini, LLaMa-3.3-70B, etc.

When I got rate-limited, I opened a second acount to see what the “shadow charge” on PRO users was. Over two days, I used the $2 credit after around 80 stories. The equivalent charge from OpenAI was ~$0.40.

Is the coming PRO “pay as you go” likely to be that high?

Thanks, -Charlie Dolan

PS Really miss using LLaMa-3.3-70B because it was very often right on the mark summariizing long discursive news analysis and blog posts when GPT-4o, Sonnet 3.5 and Gemini 2.0 all wiffed.

Topic		Replies	Views
API Limit in PRO Beginners	2	143	March 5, 2025
Pro Account $2 inference limit Beginners	8	1151	March 23, 2025
Serverless Inference API credits Beginners	2	115	May 19, 2025
Inference API budget, billing limit Site Feedback	18	3192	June 4, 2025
Need help for Infernece API rate limiting Beginners	0	307	May 26, 2024

API inference limit changed?

Related topics