Model caching and locking

anjalyjayakrishnan · June 22, 2023, 8:47am

Hello,

I am using nllb models in my application through hugging face. Currently the model is getting loaded with each api call, which in turn results in higher execution time.

I need to keep loaded model in something like a cache so that it can be reused for next api calls. Does hugging face provides any functionality of caching and locking for loaded models?

Your prompt response will be highly appreciated.

Thanks,
Anjaly

Topic		Replies	Views
Caching when using HuggingFace Endpoint Models	0	305	April 15, 2024
Prakash Hinduja Switzerland (Swiss) How do I load a pre-trained model in Hugging Face? Beginners	1	21	June 26, 2025
Download llama for offline computer Models	1	1132	September 13, 2023
Model currently loading Beginners	0	827	April 22, 2021
Loading a retrained model locally Beginners	2	2418	February 5, 2024

Model caching and locking

Related topics