Hasim Sak

research

∙ 05/27/2022

Contrastive Siamese Network for Semi-supervised Speech Recognition

This paper introduces contrastive siamese (c-siam) network, an architect...

5 Soheil Khorram, et al. ∙

research

∙ 09/23/2021

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection

In this paper, we present a novel speaker diarization system for streami...

0 Wei Xia, et al. ∙

research

∙ 05/06/2021

Reducing Streaming ASR Model Delay with Self Alignment

Reducing prediction delay for streaming end-to-end ASR models with minim...

4 Jaeyoung Kim, et al. ∙

research

∙ 10/07/2020

Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition

In this paper we present a Transformer-Transducer model architecture and...

0 Anshuman Tripathi, et al. ∙

research

∙ 02/26/2020

A Density Ratio Approach to Language Model Fusion in End-To-End Automatic Speech Recognition

This article describes a density ratio approach to integrating external ...

0 Erik McDermott, et al. ∙

research

∙ 02/07/2020

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss

In this paper we present an end-to-end speech recognition model with Tra...

0 Qian Zhang, et al. ∙

research

∙ 06/17/2019

Adversarial Training for Multilingual Acoustic Modeling

Multilingual training has been shown to improve acoustic modeling perfor...

0 Ke Hu, et al. ∙

research

∙ 07/13/2018

Large-Scale Visual Speech Recognition

This work presents a scalable solution to open-vocabulary visual speech ...

68 Brendan Shillingford, et al. ∙

research

∙ 01/02/2018

Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer

We investigate training end-to-end speech recognition models with the re...

0 Kanishka Rao, et al. ∙

research

∙ 11/20/2017

Speech recognition for medical conversations

In this paper we document our experiences with developing speech recogni...

0 Chung-Cheng Chiu, et al. ∙

research

∙ 10/31/2016

Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition

We present results that show it is possible to build a competitive, grea...

0 Hagen Soltau, et al. ∙

research

∙ 03/10/2016

Personalized Speech recognition on mobile devices

We describe a large vocabulary speech recognition system that is accurat...

0 Ian McGraw, et al. ∙

research

∙ 07/24/2015

Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition

We have recently shown that deep Long Short-Term Memory (LSTM) recurrent...

0 Hasim Sak, et al. ∙

research

∙ 02/05/2014

Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) archit...

0 Hasim Sak, et al. ∙

Hasim Sak

Featured Co-authors

Sign in with Google

Consider DeepAI Pro