What does it take to create the Babel Fish, a tool that can help individ...
Expanding the language coverage of speech technology has the potential t...
Automatic speech recognition research focuses on training and evaluating...
We study speech-to-speech translation (S2ST) that translates speech from...
We propose a novel deliberation-based approach to end-to-end (E2E) spoke...
We introduce dGSLM, the first "textless" model able to generate audio sa...
Textless spoken language processing research aims to extend the applicab...
As the computational requirements for machine learning systems and the s...
Is pushing numbers on a single benchmark valuable in automatic speech
re...
Self-training and unsupervised pre-training have emerged as effective
ap...
We study training a single acoustic model for multiple languages with th...
Convolutional neural networks (CNNs) have become increasingly popular fo...