Can you replace data_dir= with data_files= in the last load_dataset call and try again?
data_dir=
data_files=
load_dataset