We study a streamable attention-based encoder-decoder model in which eit...
We introduce a novel segmental-attention model for automatic speech
reco...
The peaky behavior of CTC models is well known experimentally. However, ...
With the advent of direct models in automatic speech recognition (ASR), ...
Attention-based encoder-decoder (AED) models learn an implicit internal
...
We present our transducer model on Librispeech. We study variants to inc...
End-to-end models reach state-of-the-art performance for speech recognit...
Common end-to-end models like CTC or encoder-decoder-attention models us...
The RNN transducer is a promising end-to-end model candidate. We compare...
Recent advances in text-to-speech (TTS) led to the development of flexib...
Attention-based sequence-to-sequence models have shown promising results...
This work investigates a simple data augmentation technique, SpecAugment...
We explore multi-layer autoregressive Transformer models in language mod...
We present state-of-the-art automatic speech recognition (ASR) systems
e...
We compare the fast training and decoding speed of RETURNN of attention
...
Sequence-to-sequence attention-based models on subword units allow simpl...
In this work we release our extensible and easily configurable neural ne...
We present a comprehensive study of deep bidirectional long short-term m...