Add new feature without changing number or rows
|
|
2
|
803
|
May 18, 2022
|
Max individual file size for LFS files is 46.6GB
|
|
2
|
3200
|
May 19, 2022
|
Creating a SyntaxGym dataset -- structure and evaluation questions
|
|
3
|
702
|
May 24, 2022
|
Centralized Benchmarks
|
|
2
|
494
|
May 24, 2022
|
Accessing dataset is very slow compared to torchvision
|
|
2
|
1315
|
May 24, 2022
|
Create wikitext2 dataset offline
|
|
0
|
635
|
May 24, 2022
|
Can't run trainer.predict(). ValueError: 'process_id' should be a number greater than 0
|
|
2
|
1368
|
May 28, 2022
|
Allow streaming of large datasets with image/audio
|
|
18
|
3980
|
May 30, 2022
|
Working with large datasets - cache issues
|
|
1
|
1037
|
June 1, 2022
|
How to clip audio files in an audio dataset?
|
|
1
|
486
|
June 1, 2022
|
Class Labels for Custom Datasets
|
|
4
|
18118
|
June 2, 2022
|
Is there any way in which i can convert my CSV data directly to Conll2003 format?
|
|
1
|
1259
|
June 2, 2022
|
Loading Huge Image Dataset seems to take a lot of time
|
|
7
|
3765
|
May 16, 2022
|
Using a custom metric on the Huggingface Hub
|
|
1
|
1543
|
June 3, 2022
|
In-memory dataset to disk for caching operations
|
|
1
|
938
|
May 2, 2022
|
Running out of Diskspace
|
|
1
|
3110
|
April 26, 2022
|
Issue with Custom Nested Metrics
|
|
1
|
884
|
November 5, 2021
|
Set_transform and group_by_length=True
|
|
3
|
3299
|
June 10, 2021
|
Does load_dataset load the data in to the memory?
|
|
1
|
502
|
February 22, 2021
|
Extend load_from_disk and save_to_disk to remote storage
|
|
3
|
527
|
October 12, 2020
|
DPR Context tokenization in a GPU
|
|
4
|
1182
|
September 25, 2020
|
MyPy and DatasetDict. Error: Incompatible return value type (got "Union[DatasetDict, Dataset, IterableDatasetDict, IterableDataset]", expected "DatasetDict")
|
|
2
|
1085
|
April 26, 2022
|
Wav2vec2 pretraining on own wav files
|
|
2
|
1019
|
April 24, 2022
|
How to use several datasets that fit into the RAM?
|
|
1
|
502
|
November 5, 2021
|
Datasets map is slower than pandas apply
|
|
0
|
1119
|
April 23, 2022
|
Dataset.map hangs on tokenization (relatively small dataset)
|
|
2
|
2005
|
April 22, 2022
|
Most efficient way to retrieve N rows for a subset of columns
|
|
2
|
1535
|
November 3, 2021
|
Map method to tokenize raises index error
|
|
9
|
4289
|
June 9, 2021
|
When calling load_metric ('rouge') what file is downloaded (and where do I find it)?
|
|
1
|
1895
|
April 22, 2022
|
Map() function doesn't process
|
|
2
|
1084
|
April 21, 2022
|