How to pass a pipeline over a dataset with multiple columns
|
|
4
|
1057
|
September 6, 2023
|
How to write a dataset load script using private S3 storage
|
|
2
|
1361
|
December 1, 2022
|
How can I parallelize a metric?
|
|
3
|
1173
|
December 8, 2021
|
Slow in generating train split when loading local dataset
|
|
1
|
1659
|
January 12, 2024
|
Weird example of batching in Dataset.map document
|
|
4
|
1047
|
September 4, 2023
|
Calling Silero VAD model from dataset.map
|
|
2
|
1346
|
October 12, 2023
|
Dataset_infos.json getting cached?
|
|
2
|
1344
|
August 4, 2022
|
Saving custom dataset does not finish
|
|
3
|
1159
|
March 16, 2023
|
Progress bar of dataset.map with num_proc>1 hangs
|
|
2
|
1330
|
December 6, 2023
|
Add Sequence(feature=ClassLabel(...), ...) to an existing dataset
|
|
1
|
1625
|
May 2, 2022
|
Describe a nullable/optional column in dataset loading script
|
|
3
|
1142
|
November 12, 2021
|
Accessing dataset is very slow compared to torchvision
|
|
2
|
1315
|
May 24, 2022
|
Dataset map during runtime
|
|
2
|
1311
|
September 13, 2023
|
Sharing the cache folder
|
|
1
|
1595
|
December 12, 2022
|
`load_dataset`: how to extract only the validation split?
|
|
2
|
1290
|
March 15, 2023
|
RuntimeError: Tensors of the same index must be on the same device and the same dtype except `step` tensors that can be CPU and float32 notwithstanding
|
|
0
|
396
|
May 26, 2024
|
Dataset Object without ClassLabel
|
|
3
|
1103
|
March 8, 2023
|
Should I shard dataset in distributed training?
|
|
2
|
714
|
December 3, 2021
|
Dataset only have n_shard=1 when has multiple shards in repo
|
|
1
|
1549
|
July 1, 2022
|
Using a custom metric on the Huggingface Hub
|
|
1
|
1543
|
June 3, 2022
|
Datasets mapper hanging issue
|
|
2
|
1255
|
March 8, 2023
|
Error thread 'polars' panicked when reading dataset using polars
|
|
2
|
396
|
August 19, 2024
|
How to create dataset from github
|
|
2
|
1251
|
March 1, 2024
|
Packing multiple samples into context window
|
|
1
|
1531
|
January 12, 2024
|
Create batch from list of ids in the dataset is very slow
|
|
5
|
874
|
September 4, 2025
|
Unable to upload large audio dataset using push_to_hub
|
|
5
|
872
|
November 17, 2023
|
How to get maximum and minimum value of features?
|
|
1
|
1508
|
March 31, 2022
|
Can we download dataset from folder of text file
|
|
2
|
1229
|
January 18, 2022
|
Undesired behavior when using load_dataset
|
|
4
|
951
|
April 17, 2023
|
Unable to load mozilla-foundation/common_voice_6_0 dataset
|
|
2
|
1221
|
April 4, 2022
|
Why our dataset have unsafe files?
|
|
6
|
796
|
September 25, 2023
|
How to change the format of a dataset
|
|
3
|
1051
|
November 3, 2022
|
All of my datasets disappeared, Why?
|
|
5
|
482
|
April 12, 2024
|
Keywords/tags for searchability of dataset
|
|
2
|
1212
|
April 20, 2023
|
Could I download the dataset manually?
|
|
1
|
1475
|
January 24, 2022
|
Help on using OpenWebText dataset
|
|
2
|
1200
|
October 18, 2022
|
Generating Vocabulary using Datasets
|
|
1
|
1463
|
August 30, 2022
|
How to resolve file paths in a downloaded dataset?
|
|
4
|
919
|
March 20, 2024
|
Custom dataset and cast_column
|
|
1
|
1450
|
April 7, 2022
|
Cannot Download Pile
|
|
1
|
1449
|
October 24, 2023
|
Problems while filtering large datasets using `map`
|
|
2
|
661
|
July 21, 2021
|
Similarity Search in FAISS Returning Raw, Unintelligible Data
|
|
2
|
117
|
January 8, 2025
|
What's the data format of the QA json file in official scripts
|
|
5
|
825
|
February 24, 2023
|
datasets.Dataset.sort() does not preserve ordering
|
|
2
|
1163
|
January 16, 2023
|
I want train my own model speech recognation localy on my data my voice how to do that I can't find start I need very help
|
|
0
|
358
|
December 7, 2021
|
I uploaded a dataset through huggface web interface. But i can't load it!
|
|
3
|
1006
|
May 14, 2023
|
Explain why datasets.map is faster compared to other similar libraries
|
|
4
|
893
|
September 6, 2022
|
Saving outcomes if Error while applying map function on dataset
|
|
2
|
1150
|
February 14, 2023
|
Error while downloading my dataset
|
|
2
|
1144
|
June 21, 2023
|
How to load local dataset
|
|
1
|
1400
|
May 2, 2023
|