Accurate recognition of specific categories, such as persons' names, dat...
Recently, a number of approaches to train speech models by incorpo-ratin...
We introduce the Universal Speech Model (USM), a single large model that...
Text-only adaptation of a transducer model remains challenging for end-t...
This paper proposes Virtuoso, a massively multilingual speech-text joint...
Data augmentation is a ubiquitous technique used to provide robustness t...
Automatic speech recognition (ASR) needs to be robust to speaker differe...
The BARN (Benchmark Autonomous Robot Navigation) Challenge took place at...
Building inclusive speech recognition systems is a crucial step towards
...
Self-supervised pretraining for Automated Speech Recognition (ASR) has s...
Recent trends in neural network based text-to-speech/speech synthesis
pi...
Most data intensive applications often access only a few fields of the
o...