Custom Local Data Loading: generating split with load_dataset() not working: Values in `DatasetDict` should be of type `Dataset` but got type '<class 'datasets.dataset_dict.DatasetDict'

Hey @fkov - could you make sure that each field of your dataset (each value) is a dataset object, not a dataset dict object? You can follow the guide here for creating a custom audio dataset - Create an audio dataset

And then split it into train/test splits with - Main classes

Then your overall dataset will be a dataset dict object (i.e. has splits), with each split a dataset object

1 Like