This paper describes the NPU-MSXF system for the IWSLT 2023 speech-to-sp...
This paper aims to synthesize target speaker's speech with desired speak...
End-to-end singing voice synthesis (SVS) model VISinger can achieve bett...
Recent development of neural vocoders based on the generative adversaria...
In current two-stage neural text-to-speech (TTS) paradigm, it is ideal t...
Speaker adaptation in text-to-speech synthesis (TTS) is to finetune a
pr...
Building a high-quality singing corpus for a person who is not good at
s...
This paper introduces Opencpop, a publicly available high-quality Mandar...
In this paper, we propose VISinger, a complete end-to-end high-quality
s...