This paper describes a system developed for the GENEA (Generation and
Ev...
We introduce Matcha-TTS, a new encoder-decoder architecture for speedy T...
With read-aloud speech synthesis achieving high naturalness scores, ther...
Diffusion models have experienced a surge of interest as highly expressi...
Neural HMMs are a type of neural transducer recently proposed for
sequen...
In this paper we present a pilot study which investigates how non-verbal...
Neural sequence-to-sequence TTS has achieved significantly better output...
Text-to-speech and co-speech gesture synthesis have until now been treat...
Dance requires skillful composition of complex movements that follow
rhy...
Embodied human communication encompasses both verbal (speech) and non-ve...
Conducting user studies is a crucial component in many scientific fields...
To enable more natural face-to-face interactions, conversational agents ...
Data-driven modelling and synthesis of motion data is an active research...
This study investigates the effect of a physical robot taking the role o...
This paper presents a self-supervised method for detecting the active sp...
This paper presents the EACare project, an ambitious multi-disciplinary
...
We present a method for recognition of isolated Swedish Sign Language si...