Dialogue act classification (DAC) is a critical task for spoken language...
Due to the availability of multi-modal remote sensing (RS) image archive...
End-to-end (E2E) spoken language understanding (SLU) systems can infer t...
Multilingual ASR technology simplifies model training and deployment, bu...
User studies have shown that reducing the latency of our simultaneous le...
Recently, end-to-end sequence-to-sequence models for speech recognition ...
Multilingual Speech Recognition is one of the most costly AI problems,
b...
A large amount of data is required for automatic speech recognition (ASR...
Training automatic speech recognition (ASR) systems requires large amoun...
Using supporting backchannel (BC) cues can make human-computer interacti...