Dataset repo requires arbitrary Python code execution

kargaranamir · October 21, 2023, 12:44am

The viewer is disabled because this dataset repo requires arbitrary Python code execution. Please consider removing the loading script and relying on automated data support. If this is not possible, please open a discussion for direct help.

Why? It worked well in before. By the way load_datasets works! and viewer works yesterday as well!

The dataset:

unwilledset · October 21, 2023, 5:12am

I have the exact same problem and cannot tell where the error is, given I have not changed anything and this used to work. Hope we can get some assistance.

MajdTannous · October 21, 2023, 8:45am

I have the exact same problem

kargaranamir · October 21, 2023, 8:58am

@lhoestq, care to join? I don’t know who to tag.

lhoestq · October 21, 2023, 10:08am

We had to disable the viewer for datasets with a script for now, because some people were abusing it. Sorry for the inconvenience.

We’re seeing if there is a viable long-term solution.

In the meantime, if you want the dataset viewer to work you need to remove the dataset script and use a supported data format (csv, parquet, etc.) Personnally I’d recommend uploading the dataset using the datasets library and push_to_hub().

julien-c · October 23, 2023, 8:31am

If you have a popular datasets (> 100 likes or downloads) that is affected, please let us know here – we can allowlist popular datasets.

Thanks.

kargaranamir · October 26, 2023, 10:50am

I don’t have that much publicity. I was hoping to gain some :))
I fixed one of my datasets, deleted the other one, and I’m still trying to figure out what’s wrong with the third one since the viewer doesn’t work, even though I remove the load script and move to the automatic HuggingFace structure.

julien-c · October 26, 2023, 12:18pm

cc @severo maybe can help on that last one!

lhoestq · October 26, 2023, 12:21pm

The viewer is being created on your third dataset, it will be available soon

severo · October 26, 2023, 12:44pm

Nice

lampent · November 26, 2023, 5:34pm

Hi @julien-c, @lhoestq

Is it possible to allow my dataset “lampent/IRFL” to use the dataset viewer with a script?
In my previous work (“nlphuji/vasr”), it worked great, and I want to customize this dataset as well.

Thank you.
Ron.

severo · November 28, 2023, 11:29am

I see you made it work using data-only files, congrats!

severo · November 28, 2023, 11:29am

We recently updated the docs to make it easier to structure your dataset without a dataset script:

lampent · November 30, 2023, 11:23am

Yes, and it works great! However I would like to use the dataset viewer with a script to enable image display instead of strings.

As you can see in the fields “distractors” and “answer” this are actually image identifiers and I would like to load them into the dataset viewer as images. The only methods I am aware of is the one we used in “nlphuji/vasr” (see image below).

severo · November 30, 2023, 12:05pm

Indeed. We have an issue to handle that case, feel free to chime in, or +1.

github.com/huggingface/datasets

Multi-image loading in Imagefolder dataset

opened 04:01PM - 16 Apr 23 UTC

vvvm23

enhancement

### Feature request Extend the `imagefolder` dataloading script to support load…ing multiple images per dataset entry. This only really makes sense if a metadata file is present. Currently you can use the following format (example `metadata.jsonl`: ``` {'file_name': 'path_to_image.png', 'metadata': ...} ... ``` which will return a batch with key `image` and any other metadata. I would propose extending `file_name` to also accept a list of files, which would return a batch with key `images` and any other metadata. ### Motivation This is useful for example in segmentation tasks in computer vision models, or in text-to-image models that also accept conditioning signals such as another image, feature map, or similar. Currently if I want to do this, I would need to write a custom dataset, rather than just use `imagefolder`. ### Your contribution Would be open to doing a PR, but also happy for someone else to take it as I am not familiar with the datasets library.

leosocy · April 7, 2024, 6:42am

Hi @severo @lhoestq

Is it possible to allow my dataset “leosocy/palmnet” to use the dataset viewer with a script?

Thank you.
Leosocy.

Bakerbunker · April 26, 2024, 9:19am

Hi, @severo @lhoestq
We have a dataset Wenetspeech4TTS/WenetSpeech4TTS, is it possible to auto-convert this dataset to parquet and enable the dataset viewer?

severo · April 26, 2024, 9:38am

The last datasets release (2.19.0) provides a CLI tool to convert to data-only (parquet): Command Line Interface (CLI)

Please tell us if it works well for you!

Bakerbunker · July 3, 2024, 6:31am

Hi, we are the owner of Wenetspeech4TTS/WenetSpeech4TTS, we tried to use the CLI interface of datasets to convert the dataset, but in china we are faceing network connection issues.

severo · July 8, 2024, 8:40am

cc @Wauplin @lhoestq maybe

Topic		Replies	Views
Dataset Viewer for dataset with downloadable data 🤗Datasets	3	30	March 6, 2025
Dataset With Script Not Supported Error 🤗Datasets	1	76	March 29, 2025
Enabling dataset viewer by coexistence of loading script and parquet files 🤗Datasets	5	319	March 18, 2024
My dataset viewer is not loading 🤗Datasets	2	38	January 17, 2025
Dataset preview not showing for uploaded DatasetDict 🤗Datasets	6	2134	December 7, 2021

Dataset repo requires arbitrary Python code execution

Related topics