Chapter 7 questions

Same error here…Will there be an alternative dataset for the notebook on summarization?

Since that dataset itself no longer seems to be functional, I think using another similar dataset is the simplest workaround

yes i had a similar issue and couldnt find an equally good dataset either

For those who has issues running:

from datasets import load_dataset

raw_datasets = load_dataset(“conll2003”)

Try

from datasets import load_dataset

raw_datasets = load_dataset(“conll2003”, revision=“refs/convert/parquet”)

print(raw_datasets)

If u have issues loading kde4 dataset use opus100 instead

from datasets import load_dataset

# raw_datasets = load_dataset("kde4", lang1="en", lang2="fr",
#     trust_remote_code=True)
raw_datasets = load_dataset("opus100", "en-fr")