How to use split_dataset_by_node and shuffle on iterable dataset
|
|
4
|
689
|
September 13, 2025
|
Streaming for Saving
|
|
3
|
53
|
September 12, 2025
|
What’s the definiation of lazy loading? Is IterableDataset also faster than Dataset when loading locally?
|
|
4
|
10
|
September 12, 2025
|
Interest in a Real DeFi Trading Dataset with Microstructure Details?
|
|
4
|
36
|
September 11, 2025
|
Error in https://huggingface.co/learn/llm-course/chapter3/2?fw=pt#preprocessing-a-dataset
|
|
3
|
13
|
September 4, 2025
|
Create batch from list of ids in the dataset is very slow
|
|
5
|
872
|
September 4, 2025
|
How to get approved to get access on OASIS 3 dataset
|
|
0
|
5
|
September 3, 2025
|
Change metadata of parquet files
|
|
3
|
34
|
September 2, 2025
|
Feb 2025 CriteoPrivateAd dataset – when were the logs collected?
|
|
1
|
13
|
September 2, 2025
|
Missing dataset after PapersWithCode migration
|
|
3
|
56
|
August 29, 2025
|
Missing dataset card - Reddit-TIFU dataset
|
|
6
|
27
|
August 23, 2025
|
`save_to_disk` saving ALL data, even items I filtered out
|
|
2
|
19
|
August 21, 2025
|
DatasetInfo seems to be missing when I pull my dataset from HFHub
|
|
2
|
58
|
August 21, 2025
|
[New Dataset Release] Scottish Smallpipes in A (Preview Pack, v0.9)
|
|
0
|
12
|
August 20, 2025
|
How do you collect and structure data for an AI after-sales (SAV) agent in banking/insurance?
|
|
1
|
21
|
August 18, 2025
|
TikTok-10M Dataset
|
|
5
|
63
|
August 17, 2025
|
Error EBUG:filelock:Attempting to acquire lock
|
|
1
|
1070
|
August 15, 2025
|
Dataset flagged as unsafe due to false positive - how to resolve?
|
|
5
|
58
|
August 14, 2025
|
Vector Database
|
|
0
|
27
|
August 7, 2025
|
Error converting np float32
|
|
3
|
23
|
August 5, 2025
|
Looking for datasets with paragraph/scene-level genre labels (e.g., action, romance, dialogue)
|
|
0
|
20
|
August 5, 2025
|
Open Discord Chat Dataset (+ Model): Internet Tone Dataset for LLMs and ML
|
|
0
|
14
|
August 5, 2025
|
Error Uploading Large Folder
|
|
5
|
65
|
August 3, 2025
|
I've built a PDF to Dataset tool with Python would love your feedback
|
|
0
|
30
|
August 3, 2025
|
How to handle the cache system properly?
|
|
3
|
42
|
August 2, 2025
|
AI accidentally deleted my dataset repository
|
|
1
|
34
|
July 31, 2025
|
📢 New Demo Dataset: “DeFi-Behavior 0728” – real on-chain trading episodes, already labeled
|
|
0
|
57
|
July 29, 2025
|
Open-Source Multilingual Knowledge Signal Generator – PROTOCORE (Feedback Welcome!)
|
|
0
|
6
|
July 29, 2025
|
How download CT_DeepLesion-MedSAM2
|
|
1
|
18
|
July 25, 2025
|
Dataset dictionary of lists vs lists of dictionary features
|
|
2
|
26
|
July 23, 2025
|