b'Taesu Kim'

research

∙ 07/03/2023

Squeezing Large-Scale Diffusion Models for Mobile

The emergence of diffusion models has greatly broadened the scope of hig...

0 Jiwoong Choi, et al. ∙

research

∙ 06/04/2023

OWQ: Lessons learned from activation outliers for weight quantization in large language models

Large language models (LLMs) with hundreds of billions of parameters sho...

0 Changhun Lee, et al. ∙

research

∙ 03/15/2023

Cross-speaker Emotion Transfer by Manipulating Speech Style Latents

In recent years, emotional text-to-speech has shown considerable progres...

0 Suhee Jo, et al. ∙

research

∙ 11/07/2022

Hi,KIA: A Speech Emotion Recognition Dataset for Wake-Up Words

Wake-up words (WUW) is a short sentence used to activate a speech recogn...

0 Taesu Kim, et al. ∙

research

∙ 09/22/2022

Affective Role of the Future Autonomous Vehicle Interior

Recent advancements in autonomous technology allow for new opportunities...

0 Taesu Kim, et al. ∙

research

∙ 09/22/2022

Affective responses to chromatic ambient light in a vehicle

This study investigates the emotional responses to the color of vehicle ...

0 Taesu Kim, et al. ∙

research

∙ 07/13/2022

Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS

Expressive text-to-speech has shown improved performance in recent years...

0 Yookyung Shin, et al. ∙

research

∙ 07/05/2022

GP22: A Car Styling Dataset for Automotive Designers

An automated design data archiving could reduce the time wasted by desig...

0 Gyunpyo Lee, et al. ∙

research

∙ 10/06/2021

EdiTTS: Score-based Editing for Controllable Text-to-Speech

We present EdiTTS, an off-the-shelf speech editing methodology based on ...

4 Jaesung Tae, et al. ∙

research

∙ 11/27/2018

Large-scale Speaker Retrieval on Random Speaker Variability Subspace

This paper describes a fast speaker search system to retrieve segments o...

0 Suwon Shon, et al. ∙

research

∙ 11/23/2018

Learning pronunciation from a foreign language in speech synthesis networks

Although there are more than 65,000 languages in the world, the pronunci...

0 Younggun Lee, et al. ∙

research

∙ 11/06/2018

Robust and fine-grained prosody control of end-to-end speech synthesis

We propose prosody embeddings for emotional and expressive speech synthe...

0 Younggun Lee, et al. ∙

research

∙ 06/04/2018

Voice Imitating Text-to-Speech Neural Networks

We propose a neural text-to-speech (TTS) model that can imitate a new sp...

0 Younggun Lee, et al. ∙

research

∙ 03/30/2017

Deep Neural Network Optimized to Resistive Memory with Nonlinear Current-Voltage Characteristics

Artificial Neural Network computation relies on intensive vector-matrix ...

0 Hyungjun Kim, et al. ∙

Taesu Kim

Featured Co-authors

Sign in with Google

Consider DeepAI Pro