This work introduces approaches to assessing phrase breaks in ESL learne...
This paper presents a speech BERT model to extract embedded prosody
info...
Machine Speech Chain, which integrates both end-to-end (E2E) automatic s...
Neural TTS has demonstrated strong capabilities to generate human-like s...
Neural TTS has shown it can generate high quality synthesized speech. In...