Cannot use Hugging Face cache on a read-only filesystem

stodo · January 27, 2025, 10:19pm

Hello,

I’m having some trouble with getting the huggingface cache to work for my particular use case. I’ve been setting up kubernetes training for my transformer models, and the way it works is that pods are initialized from a docker image and then I mount a persistent disk with the data. Since pods are initialized from a docker image, any other files relevant for training (e.g. the training data) must be stored on the persistent disk. This disk is mounted in readonly mode due to certain limitations outside of my control.

Currently, downloading the model checkpoint for whichever architecture I wish to fine-tune eats up a lot of time, so I’d like to take advantage of the HF cache. The problem is that if I do so by downloading the models to a cache folder on the previously mentioned persistent disk, then setting the HF_HOME environment variable to point to that folder, my training runs will crash since huggingface tries to write to the HF_HOME folder for some reason. The model weights are already fully downloaded and have not been updated since, so I’m not really sure why this is the case. I realize that when running AutoModel.from_pretrained I could provide a local filepath instead of the pretrained model name, but then I’m basically managing the cache myself and I’m wondering if instead there is a setting for this.

In short, how can I use cached, already downloaded model weights from a readonly filesystem?

John6666 · January 28, 2025, 3:51am

If the library is writing to the cache itself, there’s nothing you can do…
If the library is only writing to HF_HOME for other processes than caching, you might be able to get around it by using HF_HUB_CACHE instead of HF_HOME. Because HF_HOME has a wide range of effects…

stodo · January 29, 2025, 2:48am

Unfortunately that doesn’t fix the issue. I’m not sure if this is the only such instance in the code, but it looks like the download code just doesn’t consider the case where there might not be write permissions for the cache: see here. I’ll probably just end up making a PR for this.

John6666 · January 29, 2025, 5:28am

Oh, I see. In the case of Python, we tend to use makedirs as if it were a magic spell.

Topic		Replies	Views
PermissionError when writing to HF_HOME directory on startup Spaces	2	2837	May 22, 2024
Unable to write in cache folder Spaces	3	650	December 5, 2024
Permissions of Cache Directories 🤗Hub	2	750	July 17, 2025
How do I customize .cache/huggingface Beginners	2	2877	November 1, 2022
HF cache no space left on device 🤗Datasets	5	243	March 31, 2025

Cannot use Hugging Face cache on a read-only filesystem

Related topics