Most existing keyword spotting research focuses on conditions with sligh...
Although deep learning is the mainstream method in unsupervised anomalou...
The lack of data and the difficulty of multimodal fusion have always bee...
Speech emotion recognition is a challenge and an important step towards ...
Most recent research about automatic music transcription (AMT) uses
conv...
Recently, speech enhancement (SE) based on deep speech prior has attract...
Deep neural network with dual-path bi-directional long short-term memory...
Deep neural network with dual-path bi-directional long short-term memory...
Deep neural network with dual-path bi-directional long short-term memory...
Recently, data-driven based Automatic Speech Recognition (ASR) systems h...
It is an effective way that improves the performance of the existing
Aut...
One of the biggest challenges of acoustic scene classification (ASC) is ...
In this paper, we propose a new strategy for acoustic scene classificati...
The temporal dynamics and the discriminative information in the audio si...
Deep dilated temporal convolutional networks (TCN) have been proved to b...
Deep gated convolutional networks have been proved to be very effective ...
Short-time Fourier transform (STFT) is used as the front end of many pop...
In recent years, monaural speech separation has been formulated as a
sup...
In recent years, monaural speech separation has been formulated as a
sup...
This paper studies the recovery guarantees of the models of minimizing
X...