Brendan Shillingford

research

∙ 11/19/2021

More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech

In this paper we present VDTTS, a Visually-Driven Text-to-Speech model. ...

0 Michael Hassid, et al. ∙

research

∙ 07/01/2021

Interactive decoding of words from visual speech recognition models

This work describes an interactive decoding method to improve the perfor...

3 Brendan Shillingford, et al. ∙

research

∙ 11/06/2020

Large-scale multilingual audio visual dubbing

We describe a system for large-scale audiovisual translation and dubbing...

3 Yi Yang, et al. ∙

research

∙ 11/08/2019

Recurrent Neural Network Transducer for Audio-Visual Speech Recognition

This work presents a large-scale audio-visual speech recognition system ...

0 Takaki Makino, et al. ∙

research

∙ 07/05/2019

Speech bandwidth extension with WaveNet

Large-scale mobile communication systems tend to contain legacy transmis...

0 Archit Gupta, et al. ∙

research

∙ 09/27/2018

Sample Efficient Adaptive Text-to-Speech

We present a meta-learning approach for adaptive text-to-speech (TTS) wi...

2 Yutian Chen, et al. ∙

research

∙ 07/13/2018

Large-Scale Visual Speech Recognition

This work presents a scalable solution to open-vocabulary visual speech ...

68 Brendan Shillingford, et al. ∙

research

∙ 11/07/2017

Cortical microcircuits as gated-recurrent neural networks

Cortical circuits exhibit intricate recurrent architectures that are rem...

0 Rui Ponte Costa, et al. ∙

research

∙ 11/05/2016

LipNet: End-to-End Sentence-level Lipreading

Lipreading is the task of decoding text from the movement of a speaker's...

0 Yannis Assael, et al. ∙

research

∙ 06/14/2016

Learning to learn by gradient descent by gradient descent

The move from hand-designed features to learned features in machine lear...

0 Marcin Andrychowicz, et al. ∙

Brendan Shillingford

Featured Co-authors

Sign in with Google

Consider DeepAI Pro