Hi, I’m trying to load dataset mteb/amazon_massive_scenario of an earlier commit. But couldn’t do so because load_dataset(‘mteb/amazon_massive_scenario’, revision = commit_id) is returning FileNotFoundError.
The log is
Using the latest cached version of the module from /home/ubuntu/.cache/huggingface/modules/datasets_modules/datasets/mteb--amazon_massive_scenario/9da724b00b2a1038e22834c484b75d1fb0c54d5371d51858a1124c03af30f2d4 (last modified on Wed Apr 10 17:00:09 2024) since it couldn't be found locally at mteb/amazon_massive_scenario, or remotely on the Hugging Face Hub.
File {my script}, line 350, in loading_ds
ds = load_dataset(id, config_name, split=split, revision=commit_id)
File "/home/ubuntu/.venv/lib/python3.10/site-packages/datasets/load.py", line 2574, in load_dataset
builder_instance.download_and_prepare(
File "/home/ubuntu/.venv/lib/python3.10/site-packages/datasets/builder.py", line 1005, in download_and_prepare
self._download_and_prepare(
File "/home/ubuntu/.venv/lib/python3.10/site-packages/datasets/builder.py", line 1767, in _download_and_prepare
super()._download_and_prepare(
File "/home/ubuntu/.venv/lib/python3.10/site-packages/datasets/builder.py", line 1078, in _download_and_prepare
split_generators = self._split_generators(dl_manager, **split_generators_kwargs)
File "/home/ubuntu/.cache/huggingface/modules/datasets_modules/datasets/mteb--amazon_massive_scenario/9da724b00b2a1038e22834c484b75d1fb0c54d5371d51858a1124c03af30f2d4/amazon_massive_scenario.py", line 129, in _split_generators
archive_path = dl_manager.download(_URL)
File "/home/ubuntu/.venv/lib/python3.10/site-packages/datasets/download/download_manager.py", line 434, in download
downloaded_path_or_paths = map_nested(
File "/home/ubuntu/.venv/lib/python3.10/site-packages/datasets/utils/py_utils.py", line 459, in map_nested
return function(data_struct)
File "/home/ubuntu/.venv/lib/python3.10/site-packages/datasets/download/download_manager.py", line 459, in _download
out = cached_path(url_or_filename, download_config=download_config)
File "/home/ubuntu/.venv/lib/python3.10/site-packages/datasets/utils/file_utils.py", line 210, in cached_path
raise FileNotFoundError(f"Local file {url_or_filename} doesn't exist")
My huggingface hub version is 0.23.2
Has anyone encountered similar issue?
Many thanks!!