Jonas Beskow

research

∙ 09/11/2023

Diffusion-Based Co-Speech Gesture Generation Using Joint Text and Audio Representation

This paper describes a system developed for the GENEA (Generation and Ev...

0 Anna Deichler, et al. ∙

research

∙ 09/06/2023

Matcha-TTS: A fast TTS architecture with conditional flow matching

We introduce Matcha-TTS, a new encoder-decoder architecture for speedy T...

0 Shivam Mehta, et al. ∙

research

∙ 06/15/2023

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

With read-aloud speech synthesis achieving high naturalness scores, ther...

0 Shivam Mehta, et al. ∙

research

∙ 11/17/2022

Listen, denoise, action! Audio-driven motion synthesis with diffusion models

Diffusion models have experienced a surge of interest as highly expressi...

0 Simon Alexanderson, et al. ∙

research

∙ 11/13/2022

OverFlow: Putting flows on top of neural transducers for better TTS

Neural HMMs are a type of neural transducer recently proposed for sequen...

0 Shivam Mehta, et al. ∙

research

∙ 09/02/2021

Mechanical Chameleons: Evaluating the effects of a social robot's non-verbal behavior on social influence

In this paper we present a pilot study which investigates how non-verbal...

0 Patrik Jonell, et al. ∙

research

∙ 08/30/2021

Neural HMMs are all you need (for high-quality attention-free TTS)

Neural sequence-to-sequence TTS has achieved significantly better output...

0 Shivam Mehta, et al. ∙

research

∙ 08/25/2021

Integrated Speech and Gesture Synthesis

Text-to-speech and co-speech gesture synthesis have until now been treat...

0 Siyang Wang, et al. ∙

research

∙ 06/25/2021

Transflower: probabilistic autoregressive dance generation with multimodal attention

Dance requires skillful composition of complex movements that follow rhy...

2 Guillermo Valle Pérez, et al. ∙

research

∙ 01/14/2021

Generating coherent spontaneous speech and gesture from text

Embodied human communication encompasses both verbal (speech) and non-ve...

0 Simon Alexanderson, et al. ∙

research

∙ 09/22/2020

Can we trust online crowdworkers? Comparing online and offline participants in a preference test of virtual agents

Conducting user studies is a crucial component in many scientific fields...

0 Patrik Jonell, et al. ∙

research

∙ 06/11/2020

Let's face it: Probabilistic multi-modal interlocutor-aware generation of facial gestures in dyadic settings

To enable more natural face-to-face interactions, conversational agents ...

0 Patrik Jonell, et al. ∙

research

∙ 05/16/2019

MoGlow: Probabilistic and controllable motion synthesis using normalising flows

Data-driven modelling and synthesis of motion data is an active research...

0 Gustav Eje Henter, et al. ∙

research

∙ 01/30/2019

The effect of a physical robot on vocabulary learning

This study investigates the effect of a physical robot taking the role o...

0 Andreas Wedenborn, et al. ∙

research

∙ 11/24/2017

Self-Supervised Vision-Based Detection of the Active Speaker as a Prerequisite for Socially-Aware Language Acquisition

This paper presents a self-supervised method for detecting the active sp...

0 Kalin Stefanov, et al. ∙

research

∙ 09/05/2017

Machine Learning and Social Robotics for Detecting Early Signs of Dementia

This paper presents the EACare project, an ambitious multi-disciplinary ...

0 Patrik Jonell, et al. ∙

research

∙ 11/16/2012

Visual Recognition of Isolated Swedish Sign Language Signs

We present a method for recognition of isolated Swedish Sign Language si...

0 Saad Akram, et al. ∙

Jonas Beskow

Featured Co-authors

Sign in with Google

Consider DeepAI Pro