Sharath Adavanne

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Hiroshi Saruwatari
76 publications
Tuomas Virtanen
68 publications
Yuki Mitsufuji
56 publications
Shinnosuke Takamichi
50 publications
Konstantinos Drossos
37 publications
Naoya Takahashi
29 publications
Shusuke Takahashi
23 publications
Antoine Deleforge
21 publications
Archontis Politis
20 publications
Yuichiro Koyama
17 publications
Kazuki Shimada
15 publications

research

∙ 06/15/2023

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events

While direction of arrival (DOA) of sound events is generally estimated ...

5 Kazuki Shimada, et al. ∙

research

∙ 11/04/2022

Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts

We present a multi-speaker Japanese audiobook text-to-speech (TTS) syste...

0 Detai Xin, et al. ∙

research

∙ 06/04/2022

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

This report presents the Sony-TAu Realistic Spatial Soundscapes 2022 (ST...

0 Archontis Politis, et al. ∙

research

∙ 10/29/2021

Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers

Data-based and learning-based sound source localization (SSL) has shown ...

0 Sharath Adavanne, et al. ∙

research

∙ 06/21/2021

Non-native English lexicon creation for bilingual speech synthesis

Bilingual English speakers speak English as one of their languages. Thei...

0 Arun Baby, et al. ∙

research

∙ 06/13/2021

A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection

This report presents the dataset and baseline of Task 3 of the DCASE2021...

0 Archontis Politis, et al. ∙

research

∙ 09/06/2020

Overview and Evaluation of Sound Event Localization and Detection in DCASE 2019

Sound event localization and detection is a novel area of research that ...

0 Archontis Politis, et al. ∙

research

∙ 06/02/2020

A Dataset of Reverberant Spatial Sound Scenes with Moving Sources for Sound Event Localization and Detection

This report presents the dataset and the evaluation setup of the Sound E...

0 Archontis Politis, et al. ∙

research

∙ 06/02/2020

An ASR Guided Speech Intelligibility Measure for TTS Model Selection

The perceptual quality of neural text-to-speech (TTS) is highly dependen...

0 Arun Baby, et al. ∙

research

∙ 05/21/2019

A multi-room reverberant dataset for sound event localization and detection

This paper presents the sound event localization and detection (SELD) ta...

0 Sharath Adavanne, et al. ∙

research

∙ 04/29/2019

Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network

This paper investigates the joint localization, detection, and tracking ...

0 Sharath Adavanne, et al. ∙

research

∙ 06/30/2018

Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks

In this paper, we propose a convolutional recurrent neural network for j...

0 Sharath Adavanne, et al. ∙

research

∙ 01/29/2018

Multichannel Sound Event Detection Using 3D Convolutional Neural Networks for Learning Inter-channel Features

In this paper, we propose a stacked convolutional and recurrent neural n...

0 Sharath Adavanne, et al. ∙

research

∙ 10/27/2017

Direction of arrival estimation for multiple sound sources using convolutional recurrent neural network

This paper proposes a deep neural network for estimating the directions ...

0 Sharath Adavanne, et al. ∙

research

∙ 10/09/2017

Sound event detection using weakly labeled dataset with stacked convolutional and recurrent neural network

This paper proposes a neural network architecture and training scheme to...

0 Sharath Adavanne, et al. ∙

research

∙ 10/09/2017

A report on sound event detection with different binaural features

In this paper, we compare the performance of using binaural audio featur...

0 Sharath Adavanne, et al. ∙

research

∙ 06/30/2017

Automated Audio Captioning with Recurrent Neural Networks

We present the first approach to automated audio captioning. We employ a...

0 Konstantinos Drossos, et al. ∙

research

∙ 03/07/2017

Convolutional Recurrent Neural Networks for Bird Audio Detection

Bird sounds possess distinctive spectral structure which may exhibit sma...

0 EmreÇakır, et al. ∙

Success!

An error occurred

Sharath Adavanne

Featured Co-authors

Sign in with Google

Consider DeepAI Pro