Alexander Waibel

research

∙ 08/22/2023

Convoifilter: A case study of doing cocktail party speech recognition

This paper presents an end-to-end model designed to improve automatic sp...

0 Thai Binh Nguyen, et al. ∙

research

∙ 08/07/2023

End-to-End Evaluation for Low-Latency Simultaneous Speech Translation

The challenge of low-latency speech translation has recently draw signif...

0 Christian Huber, et al. ∙

research

∙ 07/18/2023

Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow

Audio-driven talking face generation is the task of creating a lip-synch...

0 Dogucan Yaman, et al. ∙

research

∙ 06/08/2023

KIT's Multilingual Speech Translation System for IWSLT 2023

Many existing speech translation benchmarks focus on native-English spee...

0 Danni Liu, et al. ∙

research

∙ 11/21/2022

Towards continually learning new languages

Multilingual speech recognition with neural networks is often implemente...

0 Ngoc-Quan Pham, et al. ∙

research

∙ 10/17/2022

Language-agnostic Code-Switching in End-To-End Speech Recognition

Code-Switching (CS) is referred to the phenomenon of alternately using w...

0 Enes Yavuz Ugan, et al. ∙

research

∙ 10/04/2022

Code-Switching without Switching: Language Agnostic End-to-End Speech Translation

We propose a) a Language Agnostic end-to-end Speech Translation model (L...

0 Christian Huber, et al. ∙

research

∙ 06/09/2022

Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos

In this paper, we propose a neural end-to-end system for voice preservin...

15 Alexander Waibel, et al. ∙

research

∙ 04/22/2022

Exposure Correction Model to Enhance Image Quality

Exposure errors in an image cause a degradation in the contrast and low ...

0 Fevziye Irem Eyiokur, et al. ∙

research

∙ 04/12/2022

CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022

In this paper, we describe our submission to the Simultaneous Speech Tra...

0 Peter Polák, et al. ∙

research

∙ 03/29/2022

Short-Term Word-Learning in a Dynamically Changing Environment

Neural sequence-to-sequence automatic speech recognition (ASR) systems a...

0 Christian Huber, et al. ∙

research

∙ 07/05/2021

Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition

Neural sequence-to-sequence systems deliver state-of-the-art performance...

0 Christian Huber, et al. ∙

research

∙ 06/06/2021

Alpha Matte Generation from Single Input for Portrait Matting

Portrait matting is an important research problem with a wide range of a...

0 Dogucan Yaman, et al. ∙

research

∙ 05/07/2021

Efficient Weight factorization for Multilingual Speech Recognition

End-to-end multilingual speech recognition involves using a single model...

0 Ngoc-Quan Pham, et al. ∙

research

∙ 04/26/2021

CAGAN: Text-To-Image Generation with Combined Attention GANs

Generating images according to natural language descriptions is a challe...

0 Henning Schulze, et al. ∙

research

∙ 03/16/2021

A Computer Vision System to Help Prevent the Transmission of COVID-19

The COVID-19 pandemic affects every area of daily life globally. To avoi...

24 Fevziye Irem Eyiokur, et al. ∙

research

∙ 03/11/2021

Unsupervised Transfer Learning in Multilingual Neural Machine Translation with Cross-Lingual Word Embeddings

In this work we look into adding a new language to a multilingual NMT sy...

0 Carlos Mullov, et al. ∙

research

∙ 05/20/2020

Relative Positional Encoding for Speech Recognition and Direct Translation

Transformer models are powerful sequence-to-sequence architectures that ...

0 Ngoc-Quan Pham, et al. ∙

research

∙ 01/29/2020

Gun Source and Muzzle Head Detection

There is a surging need across the world for protection against gun viol...

16 Zhong Zhou, et al. ∙

research

∙ 11/21/2017

Effective Strategies in Zero-Shot Neural Machine Translation

In this paper, we proposed two strategies which can be applied to a mult...

0 Thanh-Le Ha, et al. ∙

research

∙ 11/15/2016

Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder

In this paper, we present our first attempts in building a multilingual ...

0 Thanh-Le Ha, et al. ∙

Alexander Waibel

Featured Co-authors

Sign in with Google

Consider DeepAI Pro