research
∙
12/14/2020
AV Taris: Online Audio-Visual Speech Recognition
In recent years, Automatic Speech Recognition (ASR) technology has appro...
research
∙
06/08/2020
Learning to Count Words in Fluent Speech enables Online Speech Recognition
Sequence to Sequence models, in particular the Transformer, achieve stat...
research
∙
05/19/2020
Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
The audio-visual speech fusion strategy AV Align has shown significant p...
research
∙
04/17/2020
How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition
Audio-Visual Speech Recognition (AVSR) seeks to model, and thereby explo...
research
∙
09/05/2018
Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition
Automatic speech recognition can potentially benefit from the lip motion...
research
∙
05/29/2018