Speech representations learned in a self-supervised fashion from massive...
Conformer-based end-to-end models have become ubiquitous these days and ...
End-to-end ASR models trained on large amount of data tend to be implici...
Language models have been shown to perform better with an increase in sc...
In this work, we define barge-in verification as a supervised learning t...
End-to-end speech recognition models trained using joint Connectionist
T...
Although supervised deep learning has revolutionized speech and audio
pr...
Automatic Speech Recognition (ASR) systems have found their use in numer...
Many of the recent advances in speech separation are primarily aimed at
...
Automatic Speech Recognition (ASR) systems have found their use in numer...
Accurate recognition of slot values such as domain specific words or nam...
Automatic Speech Recognition (ASR) robustness toward slot entities are
c...
Neural Language Models (NLM), when trained and evaluated with context
sp...
Goal-oriented conversational interfaces are designed to accomplish speci...
While there have been several contributions exploring state of the art
t...
We live in a world where 60
languages fluently. Members of these communi...
Non-autoregressive models greatly improve decoding speed over typical
se...
In this work, we explore a multimodal semi-supervised learning approach ...
Automatic speech recognition (ASR) systems in the medical domain that fo...
Automatic speech recognition (ASR) systems in the medical domain that fo...
We propose a novel approach to semi-supervised automatic speech recognit...
We rerank with scores from pretrained masked language models like BERT t...
Pretrained contextual word representations in NLP have greatly improved
...
In real-time dialogue systems running at scale, there is a tradeoff betw...
Environmental sound classification systems often do not perform robustly...
Self-attention has demonstrated great success in sequence-to-sequence ta...
Out-of-vocabulary word translation is a major problem for the translatio...
This paper presents our latest investigations on different features for
...
Statistical machine translation for dialectal Arabic is characterized by...