found a fix from here… UTF-16 for datasets? - #2 by mariosasko used encoding = ‘UTF-8’ in my load_dataset and it worked.