Masato Mimura

research

∙ 03/26/2023

Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder

Time-domain speech enhancement (SE) has recently been intensively invest...

0 Hao Shi, et al. ∙

research

∙ 09/08/2022

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM

Connectionist temporal classification (CTC) -based models are attractive...

0 Hayato Futami, et al. ∙

research

∙ 09/05/2022

Distilling the Knowledge of BERT for CTC-based ASR

Connectionist temporal classification (CTC) -based models are attractive...

0 Hayato Futami, et al. ∙

research

∙ 10/05/2021

ASR Rescoring and Confidence Estimation with ELECTRA

In automatic speech recognition (ASR) rescoring, the hypothesis with the...

0 Hayato Futami, et al. ∙

research

∙ 09/07/2020

On the spectrum and linear programming bound for hypergraphs

The spectrum of a graph is closely related to many graph parameters. In ...

0 Sebastian M. Cioabă, et al. ∙

research

∙ 08/09/2020

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR

Attention-based sequence-to-sequence (seq2seq) models have achieved prom...

0 Hayato Futami, et al. ∙

research

∙ 05/19/2020

Enhancing Monotonic Multihead Attention for Streaming ASR

We investigate a monotonic multihead attention (MMA) by extending hard m...

0 Hirofumi Inaguma, et al. ∙

research

∙ 05/19/2020

Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition

It is important to transcribe and archive speech data of endangered lang...

0 Kohei Matsuura, et al. ∙

research

∙ 05/10/2020

CTC-synchronous Training for Monotonic Attention Model

Monotonic chunkwise attention (MoChA) has been studied for the online st...

0 Hirofumi Inaguma, et al. ∙

research

∙ 02/16/2020

Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language

Ainu is an unwritten language that has been spoken by Ainu people who ar...

0 Kohei Matsuura, et al. ∙

research

∙ 09/22/2019

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR

Acoustic-to-word (A2W) end-to-end automatic speech recognition (ASR) sys...

0 Hirofumi Inaguma, et al. ∙

research

∙ 03/22/2019

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

This paper describes multichannel speech enhancement for improving autom...

10 Kazuki Shimada, et al. ∙

research

∙ 10/31/2017

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization

This paper presents a statistical method of single-channel speech enhanc...

0 Yoshiaki Bando, et al. ∙

Masato Mimura

Featured Co-authors

Sign in with Google

Consider DeepAI Pro