There’s a couple different ways to customize the caching directories! You can set a cache for datasets with the HF_DATASETS_CACHE
environment variable, or with cache_dir
as a parameter when you load a dataset. For models, there’s a similar environment variable.
This is a higher-level doc about how caching across the HF ecosystem works, which may or may not be interesting to you.
Hope this helps!