Barry-John Theobald

research

∙ 09/07/2023

REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation

Fully-test-time adaptation (F-TTA) can mitigate performance loss due to ...

0 Skyler Seto, et al. ∙

research

∙ 08/18/2023

Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning

We present Spatial LibriSpeech, a spatial audio dataset with over 650 ho...

0 Miguel Sarabia, et al. ∙

research

∙ 11/12/2022

Rewards Encoding Environment Dynamics Improves Preference-based Reinforcement Learning

Preference-based reinforcement learning (RL) algorithms help avoid the p...

0 Katherine Metcalf, et al. ∙

research

∙ 11/10/2022

Contrastive Self-Supervised Learning for Skeleton Representations

Human skeleton point clouds are commonly used to automatically classify ...

0 Nico Lingg, et al. ∙

research

∙ 10/26/2022

Naturalistic Head Motion Generation from Speech

Synthesizing natural head motion to accompany speech for an embodied con...

0 Trisha Mittal, et al. ∙

research

∙ 03/18/2022

Towards a Perceptual Model for Estimating the Quality of Visual Speech

Generating realistic lip motions to simulate speech production is key fo...

0 Zakaria Aldeneh, et al. ∙

research

∙ 02/12/2021

Multimodal Punctuation Prediction with Contextual Dropout

Automatic speech recognition (ASR) is widely used in consumer electronic...

0 Andrew Silva, et al. ∙

research

∙ 12/09/2020

MorphGAN: One-Shot Face Synthesis GAN for Detecting Recognition Bias

To detect bias in face recognition networks, it can be useful to probe a...

0 Nataniel Ruiz, et al. ∙

research

∙ 05/27/2020

Modality Dropout for Improved Performance-driven Talking Faces

We describe our novel deep learning approach for driving animated faces ...

0 Ahmed Hussen Abdelaziz, et al. ∙

research

∙ 04/25/2020

Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement

We present an introspection of an audiovisual speech enhancement model. ...

0 Zakaria Aldeneh, et al. ∙

research

∙ 05/15/2019

Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models

Speech-driven visual speech synthesis involves mapping features extracte...

0 Ahmed Hussen Abdelaziz, et al. ∙

research

∙ 04/02/2019

Mirroring to Build Trust in Digital Assistants

We describe experiments towards building a conversational digital assist...

0 Katherine Metcalf, et al. ∙

research

∙ 12/10/2018

Learning Sharing Behaviors with Arbitrary Numbers of Agents

We propose a method for modeling and learning turn-taking behaviors for ...

0 Katherine Metcalf, et al. ∙

research

∙ 10/03/2017

Which phoneme-to-viseme maps best improve visual-only computer lip-reading?

A critical assumption of all current visual speech recognition systems i...

0 Helen L Bear, et al. ∙

research

∙ 10/03/2017

Some observations on computer lip-reading: moving from the dream to the reality

In the quest for greater computer lip-reading performance there are a nu...

0 Helen L Bear, et al. ∙

research

∙ 10/03/2017

Resolution limits on visual speech recognition

Visual-only speech recognition is dependent upon a number of factors tha...

0 Helen L Bear, et al. ∙

Barry-John Theobald

Featured Co-authors

Sign in with Google

Consider DeepAI Pro