I’m trying to download a repo from **huggingface ** using the code below. After 3 hours of running, the repo wasn’t completely downloaded and I got this error.
from huggingface_hub import snapshot_download
snapshot_download(repo_id="openclimatefix/era5-land", repo_type="dataset",
cache_dir="/home/saben1/scratch/o/slurms/data_4")
requests.exceptions.ConnectionError: HTTPSConnectionPool(host='cdn-lfs.huggingface.co', port=443): Read timed out.
I added the parameter resume_download=True (to begin downloading from where it stops) and increased the etag_timeout, but still having a new error.
from huggingface_hub import snapshot_download
snapshot_download(repo_id="openclimatefix/era5-land", repo_type="dataset",
cache_dir="/home/saben1/scratch/o/slurms/data_4", etag_timeout=120,
resume_download=True)
requests.exceptions.ConnectionError: HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /api/datasets/openclimatefix/era5-land/revision/main (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x2aba8ecc7070>: Failed to establish a new connection: [Errno 101] Network is unreachable'))