This work builds on a previous work on unsupervised speech enhancement u...
In this paper, we present a multimodal and dynamical VAE (MDVAE)
applied...
The dynamical variational autoencoders (DVAEs) are a family of
latent-va...
Several recent studies have tested the use of transformer language model...
Understanding and controlling latent representations in deep generative
...
We propose a computational model of speech production combining a pre-tr...
In this paper, we present an unsupervised probabilistic model and associ...
This article is a survey on deep learning methods for single and multipl...
In this work, we propose a novel self-attention based neural network for...
Dynamical variational auto-encoders (DVAEs) are a class of deep generati...
The Variational Autoencoder (VAE) is a powerful deep generative model th...
In this work, we propose to extend a state-of-the-art multi-source
local...
It is increasingly considered that human speech perception and productio...
The prosody of a spoken word is determined by its surrounding context. I...
Speaker counting is the task of estimating the number of people that are...
This paper addresses the problem of sound-source localization (SSL) with...
In incremental text to speech synthesis (iTTS), the synthesizer produces...
The Variational Autoencoder (VAE) is a powerful deep generative model th...
Speaker counting is the task of estimating the number of people that are...
This paper presents a generative approach to speech enhancement based on...
Variational auto-encoders (VAEs) are deep generative latent variable mod...
This paper addresses the problem of under-determinded speech source
sepa...
We propose a method using a long short-term memory (LSTM) network to est...
This paper focuses on single-channel semi-supervised speech enhancement....
In this paper we address the problem of enhancing speech signals in nois...
This paper addresses the problem of multichannel online dereverberation....
This paper presents an online multiple-speaker localization and tracking...
In this paper we address speaker-independent multichannel speech enhance...
In this paper we address the problem of tracking multiple speakers via t...
This paper addresses the problem of online multiple-speaker localization...
This study investigates the use of non-linear unsupervised dimensionalit...
This paper addresses the problem of speech separation and enhancement fr...
This paper addresses the problem of audio source recovery from multichan...