We consider speech enhancement for signals picked up in one noisy enviro...
Audio and visual modalities are inherently connected in speech signals: ...
In the context of keyword spotting (KWS), the replacement of handcrafted...
The intelligibility and quality of speech from a mobile phone or public
...
Spoken keyword spotting (KWS) deals with the identification of keywords ...
In this paper, we propose a method to estimate the proximity of an acous...
This paper considers speech enhancement of signals picked up in one nois...
In recent years, speech processing algorithms have seen tremendous progr...
In this paper, we present a deep-learning-based framework for audio-visu...
Speech enhancement and speech separation are two related tasks, whose pu...
Despite their great performance over the years, handcrafted speech featu...
Both acoustic and visual information influence human perception of speec...
Many deep learning-based speech enhancement algorithms are designed to
m...
Keyword spotting (KWS) is experiencing an upswing due to the pervasivene...
When speaking in presence of background noise, humans reflexively change...
Humans tend to change their way of speaking when they are immersed in a ...
Audio-visual speech enhancement (AV-SE) is the task of improving speech
...
One of the biggest challenges in multi-microphone applications is the
es...
Although speech enhancement algorithms based on deep neural networks (DN...
The recently proposed relaxed binaural beamforming (RBB) optimization pr...
From the eardrum to the auditory cortex, where acoustic stimuli are deco...
In this paper we propose a Deep Neural Network (DNN) based Speech Enhanc...
In this paper we propose to use utterance-level Permutation Invariant
Tr...
We propose a novel deep learning model, which supports permutation invar...