Emotional and controllable speech synthesis is a topic that has received...
Generally speaking, the main objective when training a neural speech
syn...
As the recently proposed voice cloning system, NAUTILUS, is capable of
c...
We introduce a novel speech synthesis system, called NAUTILUS, that can
...
Voice conversion (VC) and text-to-speech (TTS) are two tasks that share ...
By representing speaker characteristic as a single fixed-length vector
e...
When the available data of a target speaker is insufficient to train a h...
This paper proposes a new architecture for speaker adaptation of
multi-s...
We investigated the impact of noisy linguistic features on the performan...
Recent neural networks such as WaveNet and sampleRNN that learn directly...
Most neural-network based speaker-adaptive acoustic models for speech
sy...