While the performance of cross-lingual TTS based on monolingual corpora ...
Automatic dubbing, which generates a corresponding version of the input
...
Although deep learning and end-to-end models have been widely used and s...
For conversational text-to-speech (TTS) systems, it is vital that the sy...
Semantic information of a sentence is crucial for improving the
expressi...
Factorizing speech as disentangled speech representations is vital to ac...
Syntactic structure of a sentence text is correlated with the prosodic
s...