Dataset viewer crashes after generating parquet files from convert_to_parquet

Hi all!
I’m relatively new to the usage of HuggingFace, and recently uploaded a dataset to HuggingFace. Because I wanted to use custom config names and splits, I wrote a dataset loading script extending the datasets.BuilderConfig and datasets.GeneratorBasedBuilder classes to my specific use cases. I used the datasets-cli test loader_script.py function to make sure everything worked as expected, then used the datasts-cli convert_to_parquet function to 1) convert my dataset to a parquet format, 2) enable the dataviewer and 3) delete the loading script on the main repository.

However, my dataset viewer is currently crashing. Everything else works as expected, including the function load_dataset. Could anyone help me fix the dataviewer on my repo? Thanks so much!!

1 Like

I don’t know what it is, but is there a parquet file that has too large row groups for DatasetViewer?