This paper introduces our system designed for Track 2, which focuses on
...
The present paper proposes a waveform boundary detection system for audi...
An automatic speaker verification system aims to verify the speaker iden...
In this paper, we propose an invertible deep learning framework called I...
Confusing-words are commonly encountered in real-life keyword spotting
a...
Modeling voices for multiple speakers and multiple languages in one
text...
High-fidelity speech can be synthesized by end-to-end text-to-speech mod...
This paper describes a conditional neural network architecture for Manda...
In this paper, we apply the NetFV and NetVLAD layers for the end-to-end
...
A novel learnable dictionary encoding layer is proposed in this paper fo...
A novel interpretable end-to-end learning scheme for language identifica...