Error while downloading a repo from Hugging Face : Read timed out

I’m trying to download a repo from **huggingface ** using the code below. After 3 hours of running, the repo wasn’t completely downloaded and I got this error.

from huggingface_hub import snapshot_download

snapshot_download(repo_id="openclimatefix/era5-land", repo_type="dataset", 
cache_dir="/home/saben1/scratch/o/slurms/data_4")

requests.exceptions.ConnectionError: HTTPSConnectionPool(host='cdn-lfs.huggingface.co', port=443): Read timed out.

I added the parameter resume_download=True (to begin downloading from where it stops) and increased the etag_timeout, but still having a new error.

from huggingface_hub import snapshot_download

snapshot_download(repo_id="openclimatefix/era5-land", repo_type="dataset", 
cache_dir="/home/saben1/scratch/o/slurms/data_4", etag_timeout=120, 
resume_download=True)

requests.exceptions.ConnectionError: HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /api/datasets/openclimatefix/era5-land/revision/main (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x2aba8ecc7070>: Failed to establish a new connection: [Errno 101] Network is unreachable'))

5 Likes

Any solution?

If the issue persists, please report it in the huggingface_hub repo.