In expressive speech synthesis it is widely adopted to use latent prosod...
Artificial speech synthesis has made a great leap in terms of naturalnes...
Whilst recent neural text-to-speech (TTS) approaches produce high-qualit...
This paper proposes a general enhancement to the Normalizing Flows (NF) ...
We present a universal neural vocoder based on Parallel WaveNet, with an...
Prosody Transfer (PT) is a technique that aims to use the prosody from a...
We present a neural text-to-speech system for fine-grained prosody trans...
Pitch detection is a fundamental problem in speech processing as F0 is u...
Statistical TTS systems that directly predict the speech waveform have
r...