Many real-time applications (e.g., Augmented/Virtual Reality, cognitive
...
High-quality data labeling from specific domains is costly and human
tim...
Automatic Speech Recognition (ASR) is a key element in new services that...
Jitter and shimmer measurements have shown to be carriers of voice quali...
Nowadays, research in speech technologies has gotten a lot out thanks to...
Keyword spotting and in particular Wake-Up-Word (WUW) detection is a ver...
This paper describes joint effort of BUT and Telefónica Research on
deve...
The effects of adding pitch and voice quality features such as jitter an...
In this work, we propose an effective approach for training unique embed...
The use of photoplethysmogram signal (PPG) for heart and sleep monitorin...
Our interaction with the world is an inherently multimodal experience.
H...
Likelihood-based generative models are a promising resource to detect
ou...
Sounds are an important source of information on our daily interactions ...
Linguistic laws constitute one of the quantitative cornerstones of moder...