Same error here…Will there be an alternative dataset for the notebook on summarization?
yes i had a similar issue and couldnt find an equally good dataset either
For those who has issues running:
from datasets import load_dataset
raw_datasets = load_dataset(“conll2003”)
Try
from datasets import load_dataset
raw_datasets = load_dataset(“conll2003”, revision=“refs/convert/parquet”)
print(raw_datasets)
If u have issues loading kde4 dataset use opus100 instead
from datasets import load_dataset
# raw_datasets = load_dataset("kde4", lang1="en", lang2="fr",
# trust_remote_code=True)
raw_datasets = load_dataset("opus100", "en-fr")