Looking for Mental Health Support Datasets for building a Multi-turn Chatbot
|
|
6
|
2822
|
September 21, 2024
|
Debugging parallel Datasets transformations
|
|
3
|
2094
|
December 17, 2022
|
Dataset map and flatten
|
|
5
|
3032
|
October 12, 2020
|
Couldn't find 'my_dataset' on the Hugging Face Hub
|
|
4
|
3313
|
May 2, 2023
|
Best Practices for Large-Scale Image Datasets? (between WebDataset and Parquet)
|
|
3
|
370
|
February 8, 2025
|
KeyError: 'data'
|
|
3
|
3677
|
February 22, 2023
|
Pretrained model 'Helsinki-NLP/opus-mt-en-ar' is not available in TFAutoModelForSeq2SeqLM
|
|
3
|
1157
|
January 24, 2022
|
Datasets filter/map hangs when multithreading
|
|
8
|
2438
|
May 2, 2023
|
PyTorch Dataset/DataLoader classes
|
|
3
|
1150
|
November 25, 2021
|
Load_from_disk and read-only filesystem
|
|
5
|
2926
|
September 21, 2023
|
Making multiple samples from single samples using HuggingFace Datasets
|
|
6
|
2709
|
March 3, 2024
|
Saving a dataset to disk after select copies the data
|
|
8
|
2324
|
April 7, 2022
|
Convert from HF audio dataset to raw audio file
|
|
1
|
865
|
November 22, 2023
|
.get_nearest_examples() throws ArrowInvalid: offset overflow while concatenating arrays
|
|
4
|
3062
|
September 30, 2020
|
Can't automatically load_dataset due to network
|
|
1
|
4833
|
April 7, 2022
|
Iterable datasets features
|
|
5
|
2786
|
September 8, 2022
|
Mapping 1 multi-element column of a dataset to multi row dataset with 1 element per row, duplicating other features
|
|
6
|
2580
|
November 4, 2022
|
Iterating over Dataset with type='torch' columns
|
|
1
|
2710
|
December 5, 2022
|
Custom Dataset Creation Guidance For Resume Parsing
|
|
0
|
1189
|
October 30, 2023
|
Does datasets.load_dataset not support a seed?
|
|
8
|
2213
|
March 24, 2023
|
One of my datasets was marked unsafe
|
|
6
|
2509
|
March 16, 2023
|
Multiple call datasets.load_from_disk() cause Memory Leak!
|
|
2
|
1209
|
February 15, 2022
|
New dataset raises 'UnexpectedSplits:' error
|
|
5
|
2702
|
August 12, 2022
|
Set_transform and group_by_length=True
|
|
3
|
3297
|
June 10, 2021
|
Streaming datasets and batched mapping
|
|
5
|
2689
|
January 10, 2022
|
Duckdb cli: what's the URL to access to a dataset?
|
|
4
|
927
|
August 18, 2023
|
Dataset preview doesn't working: "The split does not contain any rows."
|
|
3
|
1035
|
January 12, 2023
|
How to run image classification on image url
|
|
5
|
2670
|
July 21, 2022
|
Convert dataframe to NER dataset format
|
|
1
|
1453
|
May 16, 2022
|
Nested datasets and oversampling
|
|
5
|
2638
|
July 5, 2021
|
Num_worker with IterableDataset
|
|
4
|
2885
|
November 16, 2023
|
botocore.exceptions.ClientError: An error occurred (AuthorizationHeaderMalformed) while pushing to the hub
|
|
2
|
3707
|
September 6, 2022
|
Add_column() does not work if used on dataset sliced with select()
|
|
2
|
657
|
January 19, 2022
|
Any good datasets related to creative writing (books/novels)?
|
|
0
|
1135
|
August 18, 2022
|
Export own dataset with different feature types to TFRecord
|
|
6
|
1347
|
April 17, 2023
|
get batch indices when iterating DataLoader over a Dataset
|
|
1
|
4465
|
July 20, 2021
|
Is from_generator() caching? how to stop it?
|
|
2
|
648
|
June 27, 2025
|
Saving train/val/test datasets
|
|
2
|
3580
|
August 25, 2021
|
Joining datasets by column & best practices for multi-view datasets
|
|
3
|
3085
|
May 13, 2024
|
Keeping IterableDataset node-wise split fixed during DDP
|
|
8
|
2044
|
April 29, 2024
|
Can I make the interleave dataset for the longest one
|
|
1
|
1367
|
August 12, 2022
|
Error when fine-tuning with the Trainer API
|
|
4
|
2719
|
December 10, 2021
|
Pushing dataset images to Hub
|
|
4
|
2697
|
October 25, 2022
|
A couple of questions about interleave_datasets()
|
|
7
|
2090
|
March 28, 2024
|
How to use Dataset with Pytorch Lightning
|
|
1
|
4144
|
April 13, 2021
|
Best practice for saving large datasets to a cloud storage
|
|
5
|
2358
|
April 3, 2024
|
`train_test_split` with IterableDataset
|
|
2
|
1875
|
January 26, 2023
|
Datasets map modifying audio array to list?
|
|
1
|
1281
|
November 29, 2021
|
Dataset preview not showing for uploaded DatasetDict
|
|
6
|
2148
|
December 7, 2021
|
Arrowmemoryerror: realloc of size 32 GB failed
|
|
2
|
3277
|
January 6, 2023
|