Can I use a pickle file with the data_files argument with datasets?

I’m trying to use datasets to download a .pickle from the hub. I’m using the following snippet

dataset = load_dataset("SauravMaheshkar/Actor", data_files="processed/actor.pickle")

I’m running into the following error

Full Error Trace
FileNotFoundError: Couldn't find a dataset script at /content/SauravMaheshkar/Actor/Actor.py or any data file in the same directory. Couldn't find 'SauravMaheshkar/Actor' on the Hugging Face Hub either: FileNotFoundError: Unable to find 'hf://datasets/SauravMaheshkar/Actor@4c4b589ee6aff3eca6b0368a045c1bafc6cbcd51/processed/actor.pickle' with any supported extension

I think you can use huggingface_hub, instead of datasets: Download files from the Hub

1 Like

Thank you, exactly what I needed !!

This topic was automatically closed 12 hours after the last reply. New replies are no longer allowed.