Hi Everyone,
I created several config, one per lang, for that dataset : EuropeanParliament/cellar_eurovoc · Datasets at Hugging Face
configs:
- config_name: default
data_files: cellar.jsonl.gz
- config_name: en-GB
data_files: cellar_en.jsonl.gz
- config_name: fr-FR
data_files: cellar_fr.jsonl.gz
Itâs appear correctly (more or less) in the website but not in the code
from datasets import get_dataset_config_names
get_dataset_config_names("EuropeanParliament/cellar_eurovoc")
I still have one set
Any idea?
Hello @severo , nous avons travaillĂ© dans la meme Ă©quipe Ă lâIRISA a priori, le monde est petit
++
1 Like
Hi! The âdataset configs in the YAMLâ feature requires the latest release of datasets
(>=2.14.0
), so updating the installation with pip install -U datasets
should fix the problem.
Thx I updated it but I may have a cache issue now
import datasets
datasets.__version__
â2.14.4â
and
from datasets import get_dataset_config_names
get_dataset_config_names("EuropeanParliament/cellar_eurovoc")
if you want to test, give
AttributeError: âNoneTypeâ object has no attribute âBUILDER_CONFIG_CLASSâ
Try passing download_mode="force_redownload"
to redownload the files or, if this doesnât help, cache_dir="path/to/new/cache_dir"
to use a different directory for caching.