Conformer, a convolution-augmented Transformer variant, has become the d...
Spoken language understanding (SLU) tasks have been studied for many dec...
Self-supervised pre-trained transformers have improved the state of the ...
Spoken language understanding (SLU) tasks involve mapping from speech au...
Progress in speech processing has been facilitated by shared datasets an...
In this paper, we explore the use of pre-trained language models to lear...
There are a number of studies about extraction of bottleneck (BN) featur...
The Multi-target Challenge aims to assess how well current speech techno...
In this paper, we propose VoiceID loss, a novel loss function for traini...
In a recent acoustic scene classification (ASC) research field, training...
End-to-end deep learning language or dialect identification systems oper...
In this paper, we present a multi-modal online person verification syste...
This paper describes a fast speaker search system to retrieve segments o...
In this paper, we explore the use of a factorized hierarchical variation...
In this paper, we propose a Convolutional Neural Network (CNN) based spe...
The Multitarget Challenge aims to assess how well current speech technol...
Dialect identification (DID) is a special case of general language
ident...
In order to successfully annotate the Arabic speech con- tent found in
o...
Recent acoustic event classification research has focused on training
su...
Recently in speaker recognition, performance degradation due to the chan...
In real-life conditions, mismatch between development and test domain
de...
Korea University Intelligent Signal Processing Lab. (KU-ISPL) developed
...