I think there might have been recent changes to Hugging Face’s free tier limits for inference API. One of my student been using the same model and usage level consistently, so this sudden limit seems like a policy update on their end.
Check and read Hugging Face’s recent announcements or documentation.
Look it might show a billing section in your account dashboard.
If it’s urgent, the PRO subscription might be worth it temporarily until you figure out the details.