Knowing the format of Dataset of pretrained facebook-mms-tts model

Hey, i am trying to fine tune a TTS pre-trained model which is facebook-mms-tts-urd-script_arabic this is a sub model of facebook-mms-tts where in you get the urdu speech as output.
Now i want to fine tune this model on other language with my own dataset but dont know the format of the data set i should use .
it would really help me if i know the format of dataset .