Laurent Girin

research

∙ 06/13/2023

Unsupervised speech enhancement with deep dynamical generative speech and noise models

This work builds on a previous work on unsupervised speech enhancement u...

0 Xiaoyu Lin, et al. ∙

research

∙ 05/05/2023

A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning

In this paper, we present a multimodal and dynamical VAE (MDVAE) applied...

0 Samir Sadok, et al. ∙

research

∙ 03/07/2023

Speech Modeling with a Hierarchical Transformer Dynamical VAE

The dynamical variational autoencoders (DVAEs) are a family of latent-va...

0 Xiaoyu Lin, et al. ∙

research

∙ 07/04/2022

BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model

Several recent studies have tested the use of transformer language model...

3 Brooke Stephenson, et al. ∙

research

∙ 04/14/2022

Learning and controlling the source-filter representation of speech with a variational autoencoder

Understanding and controlling latent representations in deep generative ...

0 Samir Sadok, et al. ∙

research

∙ 04/05/2022

Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation

We propose a computational model of speech production combining a pre-tr...

0 Marc-Antoine Georges, et al. ∙

research

∙ 02/18/2022

Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder

In this paper, we present an unsupervised probabilistic model and associ...

0 Xiaoyu Lin, et al. ∙

research

∙ 09/08/2021

A Survey of Sound Source Localization with Deep Learning Methods

This article is a survey on deep learning methods for single and multipl...

0 Pierre-Amaury Grumiaux, et al. ∙

research

∙ 07/23/2021

SALADnet: Self-Attentive multisource Localization in the Ambisonics Domain

In this work, we propose a novel self-attention based neural network for...

0 Pierre-Amaury Grumiaux, et al. ∙

research

∙ 06/23/2021

Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders

Dynamical variational auto-encoders (DVAEs) are a class of deep generati...

0 Xiaoyu Bie, et al. ∙

research

∙ 06/11/2021

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

The Variational Autoencoder (VAE) is a powerful deep generative model th...

0 Xiaoyu Bie, et al. ∙

research

∙ 05/05/2021

Improved feature extraction for CRNN-based multiple sound source localization

In this work, we propose to extend a state-of-the-art multi-source local...

0 Pierre-Amaury Grumiaux, et al. ∙

research

∙ 04/07/2021

Learning robust speech representation with an articulatory-regularized variational autoencoder

It is increasingly considered that human speech perception and productio...

0 Marc-Antoine Georges, et al. ∙

research

∙ 02/19/2021

Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input

The prosody of a spoken word is determined by its surrounding context. I...

0 Brooke Stephenson, et al. ∙

research

∙ 01/06/2021

Multichannel CRNN for Speaker Counting: an Analysis of Performance

Speaker counting is the task of estimating the number of people that are...

0 Pierre-Amaury Grumiaux, et al. ∙

research

∙ 12/07/2020

Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function

This paper addresses the problem of sound-source localization (SSL) with...

0 Xiaofei Li, et al. ∙

research

∙ 09/04/2020

What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS

In incremental text to speech synthesis (iTTS), the synthesizer produces...

0 Brooke Stephenson, et al. ∙

research

∙ 08/28/2020

Dynamical Variational Autoencoders: A Comprehensive Review

The Variational Autoencoder (VAE) is a powerful deep generative model th...

0 Laurent Girin, et al. ∙

research

∙ 03/17/2020

High-Resolution Speaker Counting In Reverberant Rooms Using CRNN With Ambisonics Features

Speaker counting is the task of estimating the number of people that are...

0 Pierre-Amaury Grumiaux, et al. ∙

research

∙ 10/24/2019

A Recurrent Variational Autoencoder for Speech Enhancement

This paper presents a generative approach to speech enhancement based on...

0 Simon Leglaive, et al. ∙

research

∙ 08/07/2019

Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoder

Variational auto-encoders (VAEs) are deep generative latent variable mod...

0 Mostafa Sadeghi, et al. ∙

research

∙ 04/10/2019

Expectation-Maximization for Speech Source Separation Using Convolutive Transfer Function

This paper addresses the problem of under-determinded speech source sepa...

0 Xiaofei Li, et al. ∙

research

∙ 04/10/2019

Audio-noise Power Spectral Density Estimation Using Long Short-term Memory

We propose a method using a long short-term memory (LSTM) network to est...

0 Xiaofei Li, et al. ∙

research

∙ 02/08/2019

Speech enhancement with variational autoencoders and alpha-stable distributions

This paper focuses on single-channel semi-supervised speech enhancement....

0 Simon Leglaive, et al. ∙

research

∙ 02/05/2019

A variance modeling framework based on variational autoencoders for speech enhancement

In this paper we address the problem of enhancing speech signals in nois...

0 Simon Leglaive, et al. ∙

research

∙ 12/20/2018

Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering

This paper addresses the problem of multichannel online dereverberation....

0 Xiaofei Li, et al. ∙

research

∙ 12/11/2018

A cascaded multiple-speaker localization and tracking system

This paper presents an online multiple-speaker localization and tracking...

0 Xiaofei Li, et al. ∙

research

∙ 11/16/2018

Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization

In this paper we address speaker-independent multichannel speech enhance...

0 Simon Leglaive, et al. ∙

research

∙ 09/28/2018

Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers

In this paper we address the problem of tracking multiple speakers via t...

14 Yutong Ban, et al. ∙

research

∙ 09/28/2018

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments

This paper addresses the problem of online multiple-speaker localization...

0 Xiaofei Li, et al. ∙

research

∙ 06/11/2018

Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models

This study investigates the use of non-linear unsupervised dimensionalit...

0 Fanny Roche, et al. ∙

research

∙ 11/21/2017

Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function

This paper addresses the problem of speech separation and enhancement fr...

0 Xiaofei Li, et al. ∙

research

∙ 11/21/2017

Multichannel Source Separation and Speech Enhancement Using the Convolutive Transfer Function

This paper addresses the problem of audio source recovery from multichan...

0 Xiaofei Li, et al. ∙

Laurent Girin

Featured Co-authors

Sign in with Google

Consider DeepAI Pro