Large Language Models (LLMs) demonstrate impressive capabilities, yet
in...
Beam search, which is the dominant ASR decoding algorithm for end-to-end...
Sequence-to-Sequence Text-to-Speech architectures that directly generate...
When recurrent neural network transducers (RNNTs) are trained using the
...