Niko Moritz

research

∙ 09/20/2023

Directional Source Separation for Robust Speech Recognition on Smart Glasses

Modern smart glasses leverage advanced audio sensing and machine learnin...

0 Tiantian Feng, et al. ∙

research

∙ 03/30/2023

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Recently reported state-of-the-art results in visual speech recognition ...

0 Xubo Liu, et al. ∙

research

∙ 11/03/2022

Streaming Audio-Visual Speech Recognition with Alignment Regularization

Recognizing a word shortly after it is spoken is an important requiremen...

0 Pingchuan Ma, et al. ∙

research

∙ 10/20/2022

Anchored Speech Recognition with Neural Transducers

Neural transducers have gained popularity in production ASR systems, ach...

0 Desh Raj, et al. ∙

research

∙ 04/19/2022

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition

The two most popular loss functions for streaming end-to-end automatic s...

0 Niko Moritz, et al. ∙

research

∙ 03/01/2022

Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR

Graph-based temporal classification (GTC), a generalized form of the con...

0 Xuankai Chang, et al. ∙

research

∙ 11/01/2021

Sequence Transduction with Graph-based Supervision

The recurrent neural network transducer (RNN-T) objective plays a major ...

0 Niko Moritz, et al. ∙

research

∙ 10/11/2021

Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy

Pseudo-labeling (PL), a semi-supervised learning (SSL) method where a se...

0 Yosuke Higuchi, et al. ∙

research

∙ 07/02/2021

Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition

Attention-based end-to-end automatic speech recognition (ASR) systems ha...

0 Niko Moritz, et al. ∙

research

∙ 06/16/2021

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

Pseudo-labeling (PL) has been shown to be effective in semi-supervised a...

0 Yosuke Higuchi, et al. ∙

research

∙ 04/19/2021

Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers

This paper addresses end-to-end automatic speech recognition (ASR) for l...

0 Takaaki Hori, et al. ∙

research

∙ 04/07/2021

Capturing Multi-Resolution Context by Dilated Self-Attention

Self-attention has become an important and widely used neural network co...

0 Niko Moritz, et al. ∙

research

∙ 11/26/2020

Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training

The performance of automatic speech recognition (ASR) systems typically ...

0 Sameer Khurana, et al. ∙

research

∙ 10/29/2020

Semi-Supervised Speech Recognition via Graph-based Temporal Classification

Semi-supervised learning has demonstrated promising results in automatic...

0 Niko Moritz, et al. ∙

research

∙ 02/14/2020

Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR

We propose an unsupervised speaker adaptation method inspired by the neu...

0 Leda Sarı, et al. ∙

research

∙ 01/08/2020

Streaming automatic speech recognition with the transformer model

Encoder-decoder based sequence-to-sequence models have demonstrated stat...

0 Niko Moritz, et al. ∙

Niko Moritz

Featured Co-authors

Sign in with Google

Consider DeepAI Pro