LLM tutorial 7 classical NLP task

raw_datasets = load_dataset(“conll2003”)

this code is not working. Error message is “Dataset scripts are no longer supported, but found conll2003.py”. How can I solve this problem? Please let me know.

1 Like

Dataset scripts are no longer supported

The cause is literally as stated, so either pin the datasets library to an older version or try the code below (using a dataset without builder scripts).

raw_datasets = load_dataset("lhoestq/conll2003")
1 Like
Thank you for answering my question. 
I have another question: Why doesn't the error disappear when I add the word lhoestq/ to the code?
1 Like

Yeah. When specifying conll2003 (the abbreviated notation without author name), eriktks/conll2003 is probably loaded implicitly. However, specifying lhoestq/conll2003 loads a different dataset without the data builder script.

The dataset contents themselves are probably identical.