We propose ImGeoNet, a multi-view image-based 3D object detection framew...
This paper describes the ESPnet Unsupervised ASR Open-source Toolkit (EU...
Simultaneous translation (SimulMT) speeds up the translation process by
...
We study the possibilities of building a non-autoregressive speech-to-te...
Mandarin-English code-switching (CS) is frequently used among East and
S...
Speech separation has been well-developed while there are still problems...
Speech translation (ST) aims to learn transformations from speech in the...
A lack of code-switching data complicates the training of code-switching...
In this paper, we investigate the benefit that off-the-shelf word embedd...
Code-switching is about dealing with alternative languages in speech or ...
Humans can imagine a scene from a sound. We want machines to do so by us...
Recurrent neural networks (RNNs) have achieved great success in language...