This paper presents an end-to-end model designed to improve automatic sp...
The challenge of low-latency speech translation has recently draw signif...
Audio-driven talking face generation is the task of creating a
lip-synch...
Many existing speech translation benchmarks focus on native-English spee...
Multilingual speech recognition with neural networks is often implemente...
Code-Switching (CS) is referred to the phenomenon of alternately using w...
We propose a) a Language Agnostic end-to-end Speech Translation model (L...
In this paper, we propose a neural end-to-end system for voice preservin...
Exposure errors in an image cause a degradation in the contrast and low
...
In this paper, we describe our submission to the Simultaneous Speech
Tra...
Neural sequence-to-sequence automatic speech recognition (ASR) systems a...
Neural sequence-to-sequence systems deliver state-of-the-art performance...
Portrait matting is an important research problem with a wide range of
a...
End-to-end multilingual speech recognition involves using a single model...
Generating images according to natural language descriptions is a challe...
The COVID-19 pandemic affects every area of daily life globally. To avoi...
In this work we look into adding a new language to a multilingual NMT sy...
Transformer models are powerful sequence-to-sequence architectures that ...
There is a surging need across the world for protection against gun viol...
In this paper, we proposed two strategies which can be applied to a
mult...
In this paper, we present our first attempts in building a multilingual
...