How to download subset of of a dataset scripted

Hi! You can load this version of the dataset with dataset = load_dataset("oscar-corpus/OSCAR-2109", "original_nl", split="train"). Also, this dataset is gated, so you need to log in locally using the huggingface-cli login command before loading the dataset.

2 Likes