Self-supervised learning (SSL) for speech representation has been
succes...
End-to-end speech summarization (E2E SSum) directly summarizes input spe...
This paper proposes a novel automatic speech recognition (ASR) system th...
Self-supervised learning (SSL) is the latest breakthrough in speech
proc...
Self-supervised learning (SSL) has been dramatically successful not only...
This paper proposes a zero-shot text-to-speech (TTS) conditioned by a
se...
End-to-end speech summarization (E2E SSum) is a technique to directly
ge...
This paper investigates the effectiveness and implementation of
modality...
Although recent advances in deep learning technology have boosted automa...
Self-supervised learning (SSL) is seen as a very promising approach with...
Target speech extraction is a technique to extract the target speaker's ...
The combination of a deep neural network (DNN) -based speech enhancement...
We propose a cross-modal transformer-based neural correction models that...
Although recent advances in deep learning technology improved automatic
...