Hugging Face Forums
How to handle big data?
🤗Datasets
lhoestq
May 25, 2023, 3:08pm
7
it can be related to
IndexError: Invalid key: 16 is out of bounds for size 0 - #3 by Isma
show post in topic
Related topics
Topic
Replies
Views
Activity
Dataset.map hangs on tokenization (relatively small dataset)
🤗Datasets
2
2003
April 22, 2022
Using load_dataset.set_transform() function along with Trainer class
🤗Datasets
4
2627
April 26, 2021
Batching vs. Sharding a Large Dataset
🤗Datasets
4
2248
June 8, 2021
Lazy-Loading binarized shard using Hf-dataset for Hf-Trainer
🤗Datasets
4
2526
June 24, 2021
Support of very large dataset?
🤗Datasets
12
10439
August 24, 2022