Sound events in daily life carry rich information about the objective wo...
Spiking neural networks (SNN) are a promising research avenue for buildi...
Most existing deep learning-based acoustic scene classification (ASC)
ap...
Audio tagging aims to assign predefined tags to audio clips to indicate ...
Previous works on scene classification are mainly based on audio or visu...
Models based on diverse attention mechanisms have recently shined in tas...
In real life, acoustic scenes and audio events are naturally correlated....
The information of spiking neural networks (SNNs) are propagated between...
Sequential audio event tagging can provide not only the type information...
Many previous audio-visual voice-related works focus on speech, ignoring...
Detecting anchor's voice in live musical streams is an important
preproc...