[urgent]Can you reconstruct datasets using the cache file (.arrow file)?

Hi,

Basically I construct a dataset through the mapping function in jupyter notebook. I can find the cache file (.arrow). Is there a way I can reconstruct the dataset similar to “datasets.load_from_disk()”

1 Like

Sure ! If you only have the .arrow file you can do

from datasets import Dataset 

dataset = Dataset.from_file(path_to_arrow_file)
2 Likes

It works! thank you so much!