JSON parse error when trying to load my own SQuAD dataset

Hi, I’m trying to follow this notebook but I get stuck at loading my SQuAD dataset.

dataset = load_dataset('json', data_files={'train': 'squad/nl_squad_train_clean.json', 'test': 'squad/nl_squad_train_clean.json'}, field='data')

Gives the following error ArrowInvalid: JSON parse error: Column(/paragraphs/[]/qas/[]/answers/[]) changed from object to array in row 0.

Please let me know what I have to do to fix this, if needed I can post the complete stack trace.

Thanks.

1 Like