Chinese text to speech

Please recommend high-quality text-to-speech models for the Chinese language. Or maybe good speaker embeddings.
I do not speak Chinese myself so I will appreciate your opinion on the recommended model/embedding performance.
We tried 11labs with a pretrained voice but feedback was that quality was not good enough (wrong tones, etc.)