Filter pruning is widely adopted to compress and accelerate the Convolut...
Cross-speaker style transfer in speech synthesis aims at transferring a ...
Cross-speaker style transfer in speech synthesis aims at transferring a ...
Conversion of Chinese Grapheme-to-Phoneme (G2P) plays an important role ...
Unit type errors, where values with physical unit types (e.g., meters, h...
While transformers and their variant conformers show promising performan...
The creation of long melody sequences requires effective expression of
c...
Time Delay Neural Networks (TDNN)-based methods are widely used in diale...
The audio source separation tasks, such as speech enhancement, speech
se...
Interactive single-image segmentation is ubiquitous in the scientific an...
In this paper, we present a cross-lingual voice cloning approach. BN fea...
The use of future contextual information is typically shown to be helpfu...
The use of future contextual information is typically shown to be helpfu...