In this paper we present VDTTS, a Visually-Driven Text-to-Speech model.
...
This work describes an interactive decoding method to improve the perfor...
We describe a system for large-scale audiovisual translation and dubbing...
This work presents a large-scale audio-visual speech recognition system ...
Large-scale mobile communication systems tend to contain legacy transmis...
We present a meta-learning approach for adaptive text-to-speech (TTS) wi...
This work presents a scalable solution to open-vocabulary visual speech
...
Cortical circuits exhibit intricate recurrent architectures that are
rem...
Lipreading is the task of decoding text from the movement of a speaker's...
The move from hand-designed features to learned features in machine lear...