Sharing downloaded models between users

dimidd · March 19, 2024, 12:03pm

I’d like several users to share downloaded models, such that when any of the users downloads a model, e.g. using

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

the other users would be able to use it as well for inference, without having to download it again. The users would all be using linux, but may use different hosts, have different python environments and package versions, etc. So, I’ve thought creating a shared NFS mount, e.g. /models and mount it on all hosts. Then, for each user, symlink their HF cache hub dir to a shared path. E.g. ln -s /models ~/.cache/huggingface/hub.

I don’t want to symlink ~/.cache/huggingface/, since it also contains a personal HF token, and modules.

Assuming we can configure file permissions properly, could there still be issues such as:

conflicts between different versions of packages, virtualenv/conda envs etc.
file locking issues

cduk · March 21, 2024, 1:08pm

Did you implement this yet? I was thinking of doing something similar as downloading the same multi-gigabyte models on different computers in the network is not fun.

Topic		Replies	Views
Download most used models in container and load them when necessary 🤗Hub	0	1275	September 29, 2023
How to chose the platform functionality Beginners	0	13	August 7, 2024
AutoModel resolution outside of HF ecosystem 🤗Transformers	3	540	February 1, 2021
Cannot use Hugging Face cache on a read-only filesystem 🤗Transformers	3	250	January 29, 2025
Manual model download, and then move to HF cache Beginners	3	1226	September 15, 2024

Sharing downloaded models between users

Related topics