Chutong Meng | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

PetsTime
196 publications
Haizhou Li
150 publications
Yuexian Zou
85 publications
Wenwu Wang
79 publications
Mark D. Plumbley
72 publications
Qiuqiang Kong
62 publications
Mingxuan Wang
58 publications
Haohe Liu
26 publications
Xinhao Mei
19 publications
Tom Ko
18 publications
Jun Cao
17 publications

research

∙ 08/31/2023

RepCodec: A Speech Representation Codec for Speech Tokenization

With recent rapid growth of large language models (LLMs), discrete speec...

0 Zhichao Huang, et al. ∙

research

∙ 03/30/2023

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

The advancement of audio-language (AL) multimodal learning tasks has bee...

0 Xinhao Mei, et al. ∙

research

∙ 10/08/2022

CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning

Speech is the surface form of a finite set of phonetic units, which can ...

0 Chutong Meng, et al. ∙

research

∙ 04/08/2022

GigaST: A 10,000-hour Pseudo Speech Translation Corpus

This paper introduces GigaST, a large-scale pseudo speech translation (S...

0 Rong Ye, et al. ∙

Success!

An error occurred