Hugging Face Forums
UTF-16 for datasets?
🤗Datasets
mariosasko
June 19, 2023, 7:16pm
2
You can pass
encoding="utf-16"
to the
load_dataset
call.
1 Like
Random utf-8 errors from dataset
show post in topic
Related Topics
Topic
Replies
Views
Activity
Issues with non-ASCII symbols in Datasets Viewer
Site Feedback
1
993
September 17, 2021
How to ensure that the escapes for the double quotes '\"' inside the 'user content' for the training datasets will not be removed?
🤗Datasets
0
71
April 11, 2024
Cant create dataset with encoding
🤗Datasets
1
363
November 26, 2023
Turkish characters gets corrupted when loading dataset via audiofolder
🤗Datasets
1
432
April 4, 2023
UnicodeDecodeError when loading Mulit Lingual text file
🤗Datasets
1
2025
April 7, 2022