Cal Peyser

research

∙ 08/11/2023

Improving Joint Speech-Text Representations Without Alignment

The last year has seen astonishing progress in text-prompted image gener...

0 Cal Peyser, et al. ∙

research

∙ 04/19/2023

A Comparison of Semi-Supervised Learning Techniques for Streaming ASR at Scale

Unpaired text and audio injection have emerged as dominant methods for i...

3 Cal Peyser, et al. ∙

research

∙ 01/11/2023

Dual Learning for Large Vocabulary On-Device ASR

Dual learning is a paradigm for semi-supervised machine learning that se...

1 Cal Peyser, et al. ∙

research

∙ 11/28/2022

E2E Segmentation in a Two-Pass Cascaded Encoder ASR Model

We explore unifying a neural segmenter with two-pass cascaded encoder AS...

0 W. Ronny Huang, et al. ∙

research

∙ 08/28/2022

Towards Disentangled Speech Representations

The careful construction of audio representations has become a dominant ...

4 Cal Peyser, et al. ∙

research

∙ 04/22/2022

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR

Improving the performance of end-to-end ASR models on long utterances ra...

0 W. Ronny Huang, et al. ∙

research

∙ 04/15/2022

Improving Rare Word Recognition with LM-aware MWER Training

Language models (LMs) significantly improve the recognition accuracy of ...

0 Weiran Wang, et al. ∙

research

∙ 03/09/2022

Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition

Language model fusion helps smart assistants recognize words which are r...

5 W. Ronny Huang, et al. ∙

research

∙ 04/09/2021

Lookup-Table Recurrent Language Models for Long Tail Speech Recognition

We introduce Lookup-Table Language Models (LookupLM), a method for scali...

0 W. Ronny Huang, et al. ∙

research

∙ 08/24/2020

Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus

End-to-end (E2E) automatic speech recognition (ASR) systems lack the dis...

0 Cal Peyser, et al. ∙

research

∙ 05/19/2020

Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion

Proper nouns present a challenge for end-to-end (E2E) automatic speech r...

0 Cal Peyser, et al. ∙

research

∙ 03/28/2020

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Thus far, end-to-end (E2E) models have not been shown to outperform stat...

0 Tara N. Sainath, et al. ∙

research

∙ 07/01/2019

Improving Performance of End-to-End ASR on Numeric Sequences

Recognizing written domain numeric utterances (e.g. I need 1.25.) can be...

2 Cal Peyser, et al. ∙

Cal Peyser

Featured Co-authors

Sign in with Google

Consider DeepAI Pro