Vamsi Krishna Ithapu

research

∙ 03/28/2023

Egocentric Auditory Attention Localization in Conversations

In a noisy conversation environment such as a dinner party, people often...

11 Fiona Ryan, et al. ∙

research

∙ 01/20/2023

Novel-View Acoustic Synthesis

We introduce the novel-view acoustic synthesis (NVAS) task: given the si...

0 Changan Chen, et al. ∙

research

∙ 01/04/2023

Chat2Map: Efficient Scene Mapping from Multi-Ego Conversations

Can conversational videos captured from multiple egocentric viewpoints r...

0 Sagnik Majumder, et al. ∙

research

∙ 11/20/2022

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

Audio-visual speech enhancement aims to extract clean speech from a nois...

0 Rodrigo Mira, et al. ∙

research

∙ 11/16/2022

Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-channel Speech Enhancement

Most speech enhancement (SE) models learn a point estimate, and do not m...

0 Kuan-Lin Chen, et al. ∙

research

∙ 11/08/2022

Towards Improved Room Impulse Response Estimation for Speech Recognition

We propose to characterize and improve the performance of blind room imp...

0 Anton Ratnarajah, et al. ∙

research

∙ 02/17/2022

RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing

We present RemixIT, a simple yet effective self-supervised method for tr...

8 Efthymios Tzinis, et al. ∙

research

∙ 02/07/2022

Deep Impulse Responses: Estimating and Parameterizing Filters with Deep Networks

Impulse response estimation in high noise and in-the-wild settings, with...

0 Alexander Richard, et al. ∙

research

∙ 01/06/2022

Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization

Augmented reality devices have the potential to enhance human perception...

0 Hao Jiang, et al. ∙

research

∙ 07/15/2021

Filtered Noise Shaping for Time Domain Room Impulse Response Estimation From Reverberant Speech

Deep learning approaches have emerged that aim to transform an audio sig...

0 Christian J. Steinmetz, et al. ∙

research

∙ 07/09/2021

EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments

Augmented Reality (AR) as a platform has the potential to facilitate the...

16 Jacob Donley, et al. ∙

research

∙ 06/21/2021

Do sound event representations generalize to other audio tasks? A case study in audio transfer learning

Transfer learning is critical for efficient information transfer across ...

0 Anurag Kumar, et al. ∙

research

∙ 04/12/2021

Egocentric Pose Estimation from Human Vision Span

Estimating camera wearer's body pose from an egocentric view (egopose) i...

0 Hao Jiang, et al. ∙

research

∙ 06/30/2020

A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

An important problem in machine auditory perception is to recognize and ...

0 Anurag Kumar, et al. ∙

research

∙ 12/24/2019

Audio-Visual Embodied Navigation

Moving around in the world is naturally a multisensory experience, but t...

15 Changan Chen, et al. ∙

research

∙ 10/25/2019

SeCoST: Sequential Co-Supervision for Weakly Labeled Audio Event Detection

Weakly supervised learning algorithms are critical for scaling audio eve...

0 Anurag Kumar, et al. ∙

Vamsi Krishna Ithapu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro