Kazuhito Koishida

research

∙ 09/19/2023

Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

Diffusion models power a vast majority of text-to-audio (TTA) generation...

0 Yatong Bai, et al. ∙

research

∙ 02/20/2023

Progressive Knowledge Distillation: Building Ensembles for Efficient Inference

We study the problem of progressive distillation: Given a large, pre-tra...

0 Don Kurian Dennis, et al. ∙

research

∙ 10/26/2022

SCP-GAN: Self-Correcting Discriminator Optimization for Training Consistency Preserving Metric GAN on Speech Enhancement Tasks

In recent years, Generative Adversarial Networks (GANs) have produced si...

0 Vasily Zadorozhnyy, et al. ∙

research

∙ 12/21/2021

Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations

Improving generalization is a major challenge in audio classification du...

0 Melikasadat Emami, et al. ∙

research

∙ 12/09/2021

A Training Framework for Stereo-Aware Speech Enhancement using Deep Neural Networks

Deep learning-based speech enhancement has shown unprecedented performan...

3 Bahareh Tolooshams, et al. ∙

research

∙ 12/08/2021

Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features

Unsupervised Zero-Shot Voice Conversion (VC) aims to modify the speaker ...

0 Trung Dang, et al. ∙

research

∙ 01/06/2021

Interspeech 2021 Deep Noise Suppression Challenge

The Deep Noise Suppression (DNS) challenge is designed to foster innovat...

5 Chandan K A Reddy, et al. ∙

research

∙ 06/20/2020

Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

Visual reasoning tasks such as visual question answering (VQA) require a...

0 Saeed Amizadeh, et al. ∙

research

∙ 11/20/2019

MMTM: Multimodal Transfer Module for CNN Fusion

In late fusion, each modality is processed in a separate unimodal Convol...

0 Hamid Reza Vaezi Joze, et al. ∙

research

∙ 08/04/2019

Sound Event Detection in Multichannel Audio using Convolutional Time-Frequency-Channel Squeeze and Excitation

In this study, we introduce a convolutional time-frequency-channel "Sque...

0 Wei Xia, et al. ∙

Kazuhito Koishida

Featured Co-authors

Sign in with Google

Consider DeepAI Pro