Yuanbo Hou

research

∙ 08/23/2023

Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning

Sound events in daily life carry rich information about the objective wo...

0 Yuanbo Hou, et al. ∙

research

∙ 10/27/2022

Multi-dimensional Edge-based Audio Event Relational Graph Representation Learning for Acoustic Scene Classification

Most existing deep learning-based acoustic scene classification (ASC) ap...

0 Yuanbo Hou, et al. ∙

research

∙ 10/22/2022

GCT: Gated Contextual Transformer for Sequential Audio Tagging

Audio tagging aims to assign predefined tags to audio clips to indicate ...

0 Yuanbo Hou, et al. ∙

research

∙ 08/03/2022

Audio-visual scene classification via contrastive event-object alignment and semantic-based fusion

Previous works on scene classification are mainly based on audio or visu...

0 Yuanbo Hou, et al. ∙

research

∙ 06/16/2022

Event-related data conditioning for acoustic event classification

Models based on diverse attention mechanisms have recently shined in tas...

0 Yuanbo Hou, et al. ∙

research

∙ 05/01/2022

Relation-guided acoustic scene classification aided with event embeddings

In real life, acoustic scenes and audio events are naturally correlated....

0 Yuanbo Hou, et al. ∙

research

∙ 03/22/2022

CT-SAT: Contextual Transformer for Sequential Audio Tagging

Sequential audio event tagging can provide not only the type information...

0 Yuanbo Hou, et al. ∙

research

∙ 06/21/2021

Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streams

Many previous audio-visual voice-related works focus on speech, ignoring...

0 Yuanbo Hou, et al. ∙

research

∙ 10/27/2020

Rule-embedded network for audio-visual voice activity detection in live musical video streams

Detecting anchor's voice in live musical streams is an important preproc...

0 Yuanbo Hou, et al. ∙

research

∙ 08/11/2020

Transfer Learning for Improving Singing-voice Detection in Polyphonic Instrumental Music

Detecting singing-voice in polyphonic instrumental music is critical to ...

0 Yuanbo Hou, et al. ∙

research

∙ 04/27/2019

Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering

Sound event detection (SED) methods typically rely on either strongly la...

0 Yuanbo Hou, et al. ∙

research

∙ 11/17/2018

Polyphonic audio tagging with sequentially labelled data using CRNN with learnable gated linear units

Audio tagging aims to detect the types of sound events occurring in an a...

0 Yuanbo Hou, et al. ∙

research

∙ 08/06/2018

Audio Tagging With Connectionist Temporal Classification Model Using Sequential Labelled Data

Audio tagging aims to predict one or several labels in an audio clip. Ma...

0 Yuanbo Hou, et al. ∙

Yuanbo Hou

Featured Co-authors

Sign in with Google

Consider DeepAI Pro