Hugging Face Course Chapter 7 Token Classification dataset error

When trying to download the CoNLL dataset a file not found error is raised.

from datasets import load_dataset

raw_datasets = load_dataset("conll2003")
FileNotFoundError: Couldn't find file at https://github.com/davidsbatista/NER-datasets/raw/master/CONLL2003/train.txt

I saw that David Batista removed the dataset from its original location on January 14 2022 and that may be causing the error (?).

Here is the link to his commit.

I think you need to upgrade to the latest version of datasets (cc @lhoestq )

1 Like

Indeed, I just upgraded to version 1.18.1 and the problem was solved.

Thank you very much!

1 Like