TTS that makes separate mp3 files per line of text in text file

ChuckBaggett · May 25, 2025, 12:02pm

I’m wondering if anyone knows of a TTS on HF that splits the output audio into separate audio files based on one audio file for each line in a text file.

John6666 · May 25, 2025, 12:28pm

I have seen several models that can synthesize long texts by dividing them into chunks, but if you want to divide the text at specific points, it is more reliable to preprocess the text yourself and pass it to the TTS model as a batch. In this case, you can write a loop to divide each line and pass it to the model.

Speaker (voice) consistency may be an issue, but models such as Dia may be able to guarantee this to some extent. (I have not tried it myself, though.)

Topic		Replies	Views
Creating a dataset with many utterances per audio file? 🤗Datasets	0	164	December 16, 2023
Is there any music vocals/voice-to-text model? Beginners	0	1026	July 19, 2023
Real-Time Text-to-Speech Model Models	2	1696	January 5, 2025
Can I change Text to Speech Inference API output Beginners	0	48	July 10, 2024
Fine tuning a TTS model Models	0	1793	March 7, 2023

TTS that makes separate mp3 files per line of text in text file

Related topics