Suwon Shon

research

∙ 05/18/2023

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

Conformer, a convolution-augmented Transformer variant, has become the d...

0 Yifan Peng, et al. ∙

research

∙ 12/20/2022

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

Spoken language understanding (SLU) tasks have been studied for many dec...

0 Suwon Shon, et al. ∙

research

∙ 12/16/2022

Context-aware Fine-tuning of Self-supervised Speech Models

Self-supervised pre-trained transformers have improved the state of the ...

0 Suwon Shon, et al. ∙

research

∙ 12/14/2021

On the Use of External Data for Spoken Named Entity Recognition

Spoken language understanding (SLU) tasks involve mapping from speech au...

4 Ankita Pasad, et al. ∙

research

∙ 11/19/2021

SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech

Progress in speech processing has been facilitated by shared datasets an...

21 Suwon Shon, et al. ∙

research

∙ 06/11/2021

Leveraging Pre-trained Language Model for Speech Sentiment Analysis

In this paper, we explore the use of pre-trained language models to lear...

0 Suwon Shon, et al. ∙

research

∙ 05/11/2019

Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification

There are a number of studies about extraction of bottleneck (BN) featur...

0 Achintya kr. Sarkar, et al. ∙

research

∙ 04/07/2019

MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation

The Multi-target Challenge aims to assess how well current speech techno...

0 Suwon Shon, et al. ∙

research

∙ 04/07/2019

VoiceID Loss: Speech Enhancement for Speaker Verification

In this paper, we propose VoiceID loss, a novel loss function for traini...

0 Suwon Shon, et al. ∙

research

∙ 12/04/2018

Domain Mismatch Robust Acoustic Scene Classification using Channel Information Conversion

In a recent acoustic scene classification (ASC) research field, training...

0 Seongkyu Mun, et al. ∙

research

∙ 12/04/2018

Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain

End-to-end deep learning language or dialect identification systems oper...

0 Suwon Shon, et al. ∙

research

∙ 11/27/2018

Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion

In this paper, we present a multi-modal online person verification syste...

0 Suwon Shon, et al. ∙

research

∙ 11/27/2018

Large-scale Speaker Retrieval on Random Speaker Variability Subspace

This paper describes a fast speaker search system to retrieve segments o...

0 Suwon Shon, et al. ∙

research

∙ 09/12/2018

Unsupervised Representation Learning of Speech for Dialect Identification

In this paper, we explore the use of a factorized hierarchical variation...

0 Suwon Shon, et al. ∙

research

∙ 09/12/2018

Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model

In this paper, we propose a Convolutional Neural Network (CNN) based spe...

0 Suwon Shon, et al. ∙

research

∙ 07/17/2018

MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System

The Multitarget Challenge aims to assess how well current speech technol...

1 Suwon Shon, et al. ∙

research

∙ 03/12/2018

Convolutional Neural Networks and Language Embeddings for End-to-End Dialect Recognition

Dialect identification (DID) is a special case of general language ident...

0 Suwon Shon, et al. ∙

research

∙ 08/28/2017

MIT-QCRI Arabic Dialect Identification System for the 2017 Multi-Genre Broadcast Challenge

In order to successfully annotate the Arabic speech con- tent found in o...

0 Suwon Shon, et al. ∙

research

∙ 08/11/2017

DNN Transfer Learning based Non-linear Feature Extraction for Acoustic Event Classification

Recent acoustic event classification research has focused on training su...

0 Seongkyu Mun, et al. ∙

research

∙ 08/03/2017

Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition

Recently in speaker recognition, performance degradation due to the chan...

0 Suwon Shon, et al. ∙

research

∙ 08/03/2017

Autoencoder based Domain Adaptation for Speaker Recognition under Insufficient Channel Information

In real-life conditions, mismatch between development and test domain de...

0 Suwon Shon, et al. ∙

research

∙ 02/03/2017

KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation

Korea University Intelligent Signal Processing Lab. (KU-ISPL) developed ...

0 Suwon Shon, et al. ∙

Suwon Shon

Featured Co-authors

Sign in with Google

Consider DeepAI Pro