Add Sequence(feature=ClassLabel(...), ...) to an existing dataset
|
|
1
|
261
|
May 2, 2022
|
Converting string label to int
|
|
4
|
2594
|
May 2, 2022
|
In-memory dataset to disk for caching operations
|
|
1
|
191
|
May 2, 2022
|
`push_to_hub` a dataset dict with subsets and splits (e.g., GLUE)
|
|
2
|
277
|
April 26, 2022
|
Running out of Diskspace
|
|
1
|
267
|
April 26, 2022
|
MyPy and DatasetDict. Error: Incompatible return value type (got "Union[DatasetDict, Dataset, IterableDatasetDict, IterableDataset]", expected "DatasetDict")
|
|
2
|
232
|
April 26, 2022
|
Wav2vec2 pretraining on own wav files
|
|
2
|
274
|
April 24, 2022
|
Datasets map is slower than pandas apply
|
|
0
|
211
|
April 23, 2022
|
Dataset.map hangs on tokenization (relatively small dataset)
|
|
2
|
324
|
April 22, 2022
|
When calling load_metric ('rouge') what file is downloaded (and where do I find it)?
|
|
1
|
226
|
April 22, 2022
|
Map() function doesn't process
|
|
2
|
279
|
April 21, 2022
|
Map multiprocessing Issue
|
|
18
|
2740
|
April 19, 2022
|
Common Voice 8.0.0 en using all available RAM
|
|
6
|
380
|
April 19, 2022
|
BigPatent - cased version
|
|
2
|
494
|
April 14, 2022
|
Split DataFrame into validation and train split
|
|
2
|
339
|
April 11, 2022
|
Saving a dataset to disk after select copies the data
|
|
8
|
387
|
April 7, 2022
|
Flatten List of features
|
|
1
|
469
|
April 7, 2022
|
Download only a subset of a split
|
|
4
|
334
|
April 7, 2022
|
UnicodeDecodeError when loading Mulit Lingual text file
|
|
1
|
458
|
April 7, 2022
|
Can't automatically load_dataset due to network
|
|
1
|
364
|
April 7, 2022
|
Pushing multiple splits of dataset to a single repo of Hub
|
|
1
|
467
|
April 7, 2022
|
Representing nested dictionary with different keys
|
|
5
|
314
|
April 7, 2022
|
Custom dataset and cast_column
|
|
1
|
298
|
April 7, 2022
|
Best practice loading images files
|
|
2
|
348
|
April 7, 2022
|
Format requirements of dataset when fine tuning another model
|
|
1
|
247
|
April 7, 2022
|
Initializing splits from existing Dataset objects
|
|
1
|
354
|
April 7, 2022
|
Unable to load mozilla-foundation/common_voice_6_0 dataset
|
|
2
|
398
|
April 4, 2022
|
Loading multiple serialized datasets with `multiprocessing`
|
|
2
|
327
|
April 2, 2022
|
How to get maximum and minimum value of features?
|
|
1
|
359
|
March 31, 2022
|
Map on Open Web Text consumes all RAM memory
|
|
1
|
383
|
March 31, 2022
|