Too many open files on big datasets
|
|
3
|
184
|
September 30, 2024
|
'NoneType' object is not subscriptable in .map() writing step
|
|
1
|
99
|
September 30, 2024
|
How to handle IterableDataset with HuggingFace trainer and num_workers in DDP setup
|
|
5
|
2932
|
September 26, 2024
|
Iterable of batches from IterableDataset
|
|
1
|
169
|
September 24, 2024
|
Looking for Mental Health Support Datasets for building a Multi-turn Chatbot
|
|
6
|
2378
|
September 21, 2024
|
Mentail health counseling
|
|
2
|
214
|
September 18, 2024
|
How do I download and load a dataset in batches without caching all of it?
|
|
1
|
224
|
September 16, 2024
|
Unexpected keyword 'promote_options' error
|
|
3
|
547
|
September 14, 2024
|
FAQ about datasets size
|
|
1
|
39
|
September 12, 2024
|
Can not load_dataset
|
|
2
|
75
|
September 6, 2024
|
Strange "safe" key missing error
|
|
4
|
54
|
September 6, 2024
|
Banking 77 Dataset
|
|
0
|
35
|
September 6, 2024
|
Json dump format for load_dataset
|
|
5
|
21815
|
September 5, 2024
|
[Bug?] Datasets map and concatenation after sharding OOM
|
|
1
|
31
|
September 4, 2024
|
Error Iterating over KeyDataset
|
|
0
|
30
|
August 30, 2024
|
KeyError: '__index_level_0__' error with datasets arrow_writer.py
|
|
3
|
8528
|
August 29, 2024
|
Downloading Large Dataset to HDFS: Issues with save_to_disk Method
|
|
0
|
94
|
August 29, 2024
|
ORPO/DPO dataset clarification
|
|
3
|
334
|
August 29, 2024
|
Filter Large Dataset Entry by Entry
|
|
7
|
164
|
August 28, 2024
|
BERT embeddings on big dataset
|
|
3
|
123
|
August 28, 2024
|
Why have all these projects been suspended
|
|
0
|
22
|
August 26, 2024
|
Recovering IterableDataset state if it crashes mid stream
|
|
0
|
30
|
August 22, 2024
|
Get sample index within dataasets' mapping function
|
|
0
|
36
|
August 22, 2024
|
Uploading 3D Numpy Array Dataset
|
|
3
|
166
|
August 21, 2024
|
Turn of automatic Pil image generation in load_dataset
|
|
2
|
32
|
August 21, 2024
|
Questions about Dataset.map()
|
|
6
|
86
|
August 20, 2024
|
Error thread 'polars' panicked when reading dataset using polars
|
|
2
|
335
|
August 19, 2024
|
Issue with iterable dataset that is stuck on StopIteration
|
|
4
|
217
|
August 19, 2024
|
HF Datasets are being corrupted
|
|
1
|
27
|
August 19, 2024
|
Using num_proc>1 in Dataset.map hangs
|
|
8
|
3947
|
August 19, 2024
|