Hi all, I’m kind of a beginner with the HF Interface, I was trying to load a 16 MB dataset with arabic characters, and I get the following error: I’m honestly confused what the error is.
0/site-packages/datasets/table.py", line 1833, in wrapper
return func(array, *args, **kwargs)
File “/home/user/.local/lib/python3.10/site-packages/datasets/table.py”, line 2027, in array_cast
return array.cast(pa_type)
File “pyarrow/array.pxi”, line 980, in pyarrow.lib.Array.cast
File “/home/user/.local/lib/python3.10/site-packages/pyarrow/compute.py”, line 403, in cast
return call_function(“cast”, [arr], options, memory_pool)
File “pyarrow/_compute.pyx”, line 572, in pyarrow._compute.call_function
File “pyarrow/_compute.pyx”, line 367, in pyarrow._compute.Function.call
File “pyarrow/error.pxi”, line 144, in pyarrow.lib.pyarrow_internal_check_status
File “pyarrow/error.pxi”, line 100, in pyarrow.lib.check_status
pyarrow.lib.ArrowInvalid: Failed to parse string: ‘17 - “”’ as a scalar of type int64
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File “/home/user/app/app.py”, line 9, in
dataset = load_dataset(‘FDSRashid/hadith_info’,data_files = ‘Basic_Edge_Information.csv’, token = Secret_token, split = ‘train’)
File “/home/user/.local/lib/python3.10/site-packages/datasets/load.py”, line 2153, in load_dataset
builder_instance.download_and_prepare(
File “/home/user/.local/lib/python3.10/site-packages/datasets/builder.py”, line 954, in download_and_prepare
self._download_and_prepare(
File “/home/user/.local/lib/python3.10/site-packages/datasets/builder.py”, line 1049, in _download_and_prepare
self._prepare_split(split_generator, **prepare_split_kwargs)
File “/home/user/.local/lib/python3.10/site-packages/datasets/builder.py”, line 1813, in _prepare_split
for job_id, done, content in self._prepare_split_single(
File “/home/user/.local/lib/python3.10/site-packages/datasets/builder.py”, line 1958, in _prepare_split_single
raise DatasetGenerationError(“An error occurred while generating the dataset”) from e
datasets.builder.DatasetGenerationError: An error occurred while generating the dataset