Various applications of voice synthesis have been developed independentl...
Self-supervised learning (SSL) has drawn an increased attention in the f...
In this paper, we propose a novel unsupervised text-to-speech (UTTS)
fra...
Despite the rapid progress in automatic speech recognition (ASR) researc...
Acoustic echo cancellation (AEC) plays an important role in the full-dup...
Disentangling content and speaking style information is essential for
ze...
In this paper, we present a novel framework that jointly performs speake...
Traditional studies on voice conversion (VC) have made progress with par...
Conversational bilingual speech encompasses three types of utterances: t...
The objective speech quality assessment is usually conducted by comparin...
In this study, we investigate self-supervised representation learning fo...
Target-speaker speech recognition aims to recognize target-speaker speec...
Singing voice conversion is converting the timbre in the source singing ...
In this work, we propose minimum Bayes risk (MBR) training of RNN-Transd...
The I4U consortium was established to facilitate a joint entry to NIST
s...
This study presents systems submitted by the University of Texas at Dall...
This document briefly describes the systems submitted by the Center for
...