Chenxu Hu | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Tie-Yan Liu
176 publications
Lei Xie
137 publications
Tao Qin
134 publications
Zhou Zhao
125 publications
Yi Ren
101 publications
Tao Li
100 publications
Jie Fu
99 publications
Xu Tan
86 publications
Hang Zhao
78 publications
Yuxuan Wang
65 publications
Sheng Zhao
36 publications

research

∙ 09/02/2023

DiCLET-TTS: Diffusion Model based Cross-lingual Emotion Transfer for Text-to-Speech – A Study between English and Mandarin

While the performance of cross-lingual TTS based on monolingual corpora ...

0 Tao Li, et al. ∙

research

∙ 06/29/2023

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

The Video-to-Audio (V2A) model has recently gained attention for its pra...

0 Simian Luo, et al. ∙

research

∙ 06/06/2023

ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory

Large language models (LLMs) with memory are computationally universal. ...

5 Chenxu Hu, et al. ∙

research

∙ 08/02/2022

ViP3D: End-to-end Visual Trajectory Prediction via 3D Agent Queries

Existing autonomous driving pipelines separate the perception module fro...

40 Junru Gu, et al. ∙

research

∙ 07/13/2022

Controllable and Lossless Non-Autoregressive End-to-End Text-to-Speech

Some recent studies have demonstrated the feasibility of single-stage ne...

0 Zhengxi Liu, et al. ∙

research

∙ 10/15/2021

Neural Dubber: Dubbing for Videos According to Scripts

Dubbing is a post-production process of re-recording actors' dialogues, ...

2 Chenxu Hu, et al. ∙

research

∙ 11/02/2020

CVC: Contrastive Learning for Non-parallel Voice Conversion

Cycle consistent generative adversarial network (CycleGAN) and variation...

0 Tingle Li, et al. ∙

research

∙ 06/08/2020

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech

Advanced text to speech (TTS) models such as FastSpeech can synthesize s...

0 Yi Ren, et al. ∙

Success!

An error occurred