Paden Tomasello

research

∙ 08/22/2023

SeamlessM4T-Massively Multilingual Multimodal Machine Translation

What does it take to create the Babel Fish, a tool that can help individ...

0 Seamless Communication, et al. ∙

research

∙ 05/22/2023

Scaling Speech Technology to 1,000+ Languages

Expanding the language coverage of speech technology has the potential t...

0 Vineel Pratap, et al. ∙

research

∙ 12/02/2022

Continual Learning for On-Device Speech Recognition using Disentangled Conformers

Automatic speech recognition research focuses on training and evaluating...

0 Anuj Diwan, et al. ∙

research

∙ 11/11/2022

Speech-to-Speech Translation For A Real-world Unwritten Language

We study speech-to-speech translation (S2ST) that translates speech from...

0 Peng-Jen Chen, et al. ∙

research

∙ 04/04/2022

Deliberation Model for On-Device Spoken Language Understanding

We propose a novel deliberation-based approach to end-to-end (E2E) spoke...

1 Duc Le, et al. ∙

research

∙ 03/30/2022

Generative Spoken Dialogue Language Modeling

We introduce dGSLM, the first "textless" model able to generate audio sa...

5 Tu Anh Nguyen, et al. ∙

research

∙ 02/15/2022

textless-lib: a Library for Textless Spoken Language Processing

Textless spoken language processing research aims to extend the applicab...

11 Eugene Kharitonov, et al. ∙

research

∙ 01/29/2022

Flashlight: Enabling Innovation in Tools for Machine Learning

As the computational requirements for machine learning systems and the s...

0 Jacob Kahn, et al. ∙

research

∙ 10/22/2020

Rethinking Evaluation in ASR: Are Our Models Robust Enough?

Is pushing numbers on a single benchmark valuable in automatic speech re...

0 Tatiana Likhomanenko, et al. ∙

research

∙ 10/22/2020

Self-training and Pre-training are Complementary for Speech Recognition

Self-training and unsupervised pre-training have emerged as effective ap...

0 Qiantong Xu, et al. ∙

research

∙ 07/06/2020

Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters

We study training a single acoustic model for multiple languages with th...

0 Vineel Pratap, et al. ∙

research

∙ 11/17/2018

DSCnet: Replicating Lidar Point Clouds with Deep Sensor Cloning

Convolutional neural networks (CNNs) have become increasingly popular fo...

0 Paden Tomasello, et al. ∙

Paden Tomasello

Featured Co-authors

Sign in with Google

Consider DeepAI Pro