Hugging Face Forums
Slow processing with map when using deepspeed or fairscale
🤗Datasets
jasonyoun
June 25, 2021, 6:13am
11
I was also able to reproduce the result. Thanks for the prompt support
@stas
and
@sgugger
.
2 Likes
Caching a dataset with map() when loaded with from_dict()
show post in topic
Related topics
Topic
Replies
Views
Activity
Dataset map function takes forever to run!
🤗Datasets
16
5360
August 15, 2024
When using Dataset.map to tokenize a dataset, the speed slows down as the progress approaches 100%
🤗Datasets
3
642
December 23, 2024
Tokenizer dataset is very slow
🤗Tokenizers
3
3790
March 2, 2024
Processing Large Dataset for Training GPT2 model
🤗Datasets
4
1035
April 12, 2023
Clm repeats tokenization when distributed
Intermediate
5
1271
July 15, 2022