Unable to download large datasets

Hi, I have been trying to download the droid dataset using huggingface cli, both from

and
datasets/IPEC-COMMUNITY/droid_lerobot

However, i cannot manage to download the full dataset. It downloads all of the parquet files, but only the first three (of 100) chunks of video.

Alternatively, i have tried git clone, but I get the following error:
git clone git@hf.co:datasets/cadene/droid_1.0.1

panic: runtime error: index out of range [0] with length 0

goroutine 124 [running]:
github dot com/git-lfs/git-lfs/tq.(*basicDownloadAdapter).download(0xc000290348, 0xc00a70a900, 0xc000110ce0, 0x0, 0xc00e373f58, 0x0, {0xb4ce40, 0xc011c47c00})
github dot com/git-lfs/git-lfs/tq/basic_download.go:156 +0xceb
github dot com/git-lfs/git-lfs/tq.(*basicDownloadAdapter).DoTransfer(0xc000290348, {0x40?, 0x0?}, 0xc00a70a900, 0xc000110ce0, 0x0)
github dot com/git-lfs/git-lfs/tq/basic_download.go:96 +0x42d
github dot com/git-lfs/git-lfs/tq.(*adapterBase).worker(0xc0006042d0, 0x7, {0x0, 0x0})
github dot com/git-lfs/git-lfs/tq/adapterbase.go:183 +0x597
created by github dot com/git-lfs/git-lfs/tq.(*adapterBase).Begin in goroutine 79
github dot com/git-lfs/git-lfs/tq/adapterbase.go:96 +0x27a
error: external filter ‘git-lfs filter-process’ failed
fatal: videos/chunk-040/observation.images.exterior_2_left/episode_040994.mp4: smudge filter lfs failed
warning: Clone succeeded, but checkout failed.
You can inspect what was checked out with ‘git status’
and retry with ‘git restore --source=HEAD :/’

I used both huggingface-cli and git clone, on multiple machines, but the behaviour persists.
Any idea what is going on?

1 Like

Hmm… Seems git-lfs issue.

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.