Suppose I load 2 files from the c4 dataset and cache it in a particular folder by doing something like:
dataset = load_dataset("allenai/c4", data_files={"train": ["en/c4-train.00000-of-01024.json.gz", "en/c4-train.00001-of-01024.json.gz"]}, split="train", cache_dir="temp_cache")
Then, I would like to load the same files of the c4 dataset from the cache_dir mentioned above in offline_mode : os.environ["HF_DATASETS_OFFLINE"] = "1"
.
Can you suggest me what needs to be done for the same?