Inference API offline model limit

aliosm · May 18, 2022, 7:20pm

Hello team,

I’m uaing the inference API to create a simple website, I’m using the pro plan and I want to know how many minutes the API keeps the model in memory before off-loading it?

For example, when I request the model the first time, the inference API puts the model on memory, then after how many minutes it will be off-loaded?

Thanks!

chin-cyber · May 2, 2024, 10:35am

it is stored permanently into your cache, i will suggest take it out and host it on your cloud instance

Topic		Replies	Views
Inference API time out Site Feedback	0	92	July 8, 2024
API Limits on Free Inference API Beginners	13	9642	September 15, 2024
Inference API time out problem...need help Beginners	3	337	February 28, 2024
Inference API time out? Site Feedback	2	885	February 28, 2024
My model is doesnt seem to load in Inference API Beginners	0	486	July 24, 2022

Inference API offline model limit

Related topics