Helin Wang

research

∙ 06/05/2023

Benchmarking Large Language Models on CMExam – A Comprehensive Chinese Medical Exam Dataset

Recent advancements in large language models (LLMs) have transformed the...

0 Junling Liu, et al. ∙

research

∙ 11/04/2022

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS

Expressive text-to-speech (TTS) can synthesize a new speaking style by i...

0 Dongchao Yang, et al. ∙

research

∙ 07/20/2022

Diffsound: Discrete Diffusion Model for Text-to-sound Generation

Generating sound effects that humans want is an important topic. However...

0 Dongchao Yang, et al. ∙

research

∙ 05/23/2022

Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection

The past ten years have witnessed the rapid development of text-based in...

0 Peilin Zhou, et al. ∙

research

∙ 04/27/2022

Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training

Transformer-based models attain excellent results and generalize well wh...

0 Dading Chong, et al. ∙

research

∙ 04/05/2022

RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection

Target sound detection (TSD) aims to detect the target sound from a mixt...

0 Dongchao Yang, et al. ∙

research

∙ 04/05/2022

A Two-student Learning Framework for Mixed Supervised Target Sound Detection

Target sound detection (TSD) aims to detect the target sound from mixtur...

0 Dongchao Yang, et al. ∙

research

∙ 04/02/2022

Improving Target Sound Extraction with Timestamp Information

Target sound extraction (TSE) aims to extract the sound part of a target...

0 Helin Wang, et al. ∙

research

∙ 12/19/2021

Detect what you want: Target Sound Detection

Human beings can perceive a target sound that we are interested in from ...

0 Dongchao Yang, et al. ∙

research

∙ 10/12/2021

Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information

Automated audio captioning (AAC) has developed rapidly in recent years, ...

0 Zhongjie Ye, et al. ∙

research

∙ 10/09/2021

A Mutual learning framework for Few-shot Sound Event Detection

Although prototypical network (ProtoNet) has proved to be an effective m...

0 Dongchao Yang, et al. ∙

research

∙ 07/04/2021

Audio-Oriented Multimodal Machine Comprehension: Task, Dataset and Model

While Machine Comprehension (MC) has attracted extensive research intere...

0 Zhiqi Huang, et al. ∙

research

∙ 05/21/2021

Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification

It is well known that the mismatch between training (source) and test (t...

0 Dongchao Yang, et al. ∙

research

∙ 04/08/2021

Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency

Transformer-based self-supervised models are trained as feature extracto...

0 Jinchuan Tian, et al. ∙

research

∙ 03/31/2021

SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification

In this paper, we present SpecAugment++, a novel data augmentation metho...

0 Helin Wang, et al. ∙

research

∙ 02/03/2021

A Global-local Attention Framework for Weakly Labelled Audio Tagging

Weakly labelled audio tagging aims to predict the classes of sound event...

0 Helin Wang, et al. ∙

research

∙ 07/06/2020

Acoustic Scene Classification with Spectrogram Processing Strategies

Recently, convolutional neural networks (CNN) have achieved the state-of...

0 Helin Wang, et al. ∙

research

∙ 12/14/2019

Learning discriminative and robust time-frequency representations for environmental sound classification

Convolutional neural networks (CNN) are one of the best-performing neura...

0 Helin Wang, et al. ∙

Helin Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro