This paper proposes a direct text to speech translation system using dis...
Voice activity and overlapped speech detection (respectively VAD and OSD...
Research in multilingual speech-to-text translation is topical. Having a...
Pseudo-label (PL) filtering forms a crucial part of Self-Training (ST)
m...
This article focuses on overlapped speech and gender detection in order ...
We aim at improving spoken language modeling (LM) using very large amoun...
We propose the SAMU-XLSR: Semantically-Aligned Multimodal Utterance-leve...
This paper describes the ON-TRAC Consortium translation systems develope...
We propose a simple and effective cross-lingual transfer learning method...
Speaker segmentation consists in partitioning a conversation between one...
In this paper, we propose a novel end-to-end sequence-to-sequence spoken...
More than half of the 7,000 languages in the world are in imminent dange...
Probabilistic Latent Variable Models (LVMs) provide an alternative to
se...
In this paper we demonstrate methods for reliable and efficient training...
This work investigates spoken language understanding (SLU) systems in th...
We present an end-to-end approach to extract semantic concepts directly ...
Named entity recognition (NER) is among SLU tasks that usually extract
s...