How to download data from hugging face that is visible on the data viewer but the files are not available?

Hi @brando,

you can get the parquet files for every config by clicking Auto-converted to Parquet.

Capture d’écran 2023-08-14 à 12.42.33

For example, for the hacker_news train split, it would send to EleutherAI/pile at refs/convert/parquet.

Also note that if you click on API,

Capture d’écran 2023-08-14 à 12.51.46

you have access to the REST API endpoints

So you can download:

1 Like