Soo-Young Lee

research

∙ 12/07/2021

Multi-speaker Emotional Text-to-speech Synthesizer

We present a methodology to train our multi-speaker emotional text-to-sp...

0 Sungjae Cho, et al. ∙

research

∙ 11/26/2020

Unigram-Normalized Perplexity as a Language Model Performance Measure with Different Vocabulary Sizes

Although Perplexity is a widely used performance metric for language mod...

0 Jihyeon Roh, et al. ∙

research

∙ 09/18/2020

Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models

We report a GPT-based multi-sentence language model for dialogue generat...

0 Jihyeon Roh, et al. ∙

research

∙ 03/14/2020

Semi-supervised Disentanglement with Independent Vector Variational Autoencoders

We aim to separate the generative factors of data into two latent vector...

8 Bo-Kyeong Kim, et al. ∙

research

∙ 11/11/2019

Emotional Voice Conversion using multitask learning with Text-to-speech

Voice conversion (VC) is a task to transform a person's voice to differe...

0 Tae-Ho Kim, et al. ∙

research

∙ 06/13/2019

Adjusting Pleasure-Arousal-Dominance for Continuous Emotional Text-to-speech Synthesizer

Emotion is not limited to discrete categories of happy, sad, angry, fear...

0 Azam Rabiee, et al. ∙

research

∙ 11/06/2018

Unpaired Speech Enhancement by Acoustic and Adversarial Supervision for Speech Recognition

Many speech enhancement methods try to learn the relationship between no...

0 Geonmin Kim, et al. ∙

research

∙ 10/12/2018

A Fully Time-domain Neural Model for Subband-based Speech Synthesizer

This paper introduces a deep neural network model for subband-based spee...

0 Azam Rabiee, et al. ∙

research

∙ 09/04/2018

End-to-end Multimodal Emotion and Gender Recognition with Dynamic Joint Loss Weights

Multi-task learning is a method for improving the generalizability of mu...

0 Myungsu Chae, et al. ∙

research

∙ 09/04/2018

End-to-end Multimodal Emotion and Gender Recognition with Dynamic Weights of Joint Loss

Multi-task learning (MTL) is one of the method for improving generalizab...

0 Myungsu Chae, et al. ∙

research

∙ 06/04/2018

Voice Imitating Text-to-Speech Neural Networks

We propose a neural text-to-speech (TTS) model that can imitate a new sp...

0 Younggun Lee, et al. ∙

research

∙ 11/15/2017

Emotional End-to-End Neural Speech Synthesizer

In this paper, we introduce an emotional speech synthesizer based on the...

0 Younggun Lee, et al. ∙

research

∙ 06/10/2016

Deep CNNs along the Time Axis with Intermap Pooling for Robustness to Spectral Variations

Convolutional neural networks (CNNs) with convolutional and pooling oper...

0 Hwaran Lee, et al. ∙

research

∙ 05/02/2016

Compositional Sentence Representation from Character within Large Context Text

This paper describes a Hierarchical Composition Recurrent Network (HCRN)...

0 Geonmin Kim, et al. ∙

Soo-Young Lee

Featured Co-authors

Sign in with Google

Consider DeepAI Pro