This paper is the system description of the DKU-MSXF System for the trac...
In this paper, we introduce a large-scale and high-quality audio-visual
...
This paper describes the NPU-MSXF system for the IWSLT 2023 speech-to-sp...
Structural magnetic resonance imaging (sMRI) provides accurate estimates...
Structural magnetic resonance imaging (sMRI) has shown great clinical va...
End-to-end automatic speech recognition (ASR) usually suffers from
perfo...
We investigate the linear stability analysis of a pathway-based diffusio...
In current two-stage neural text-to-speech (TTS) paradigm, it is ideal t...
Multimodal knowledge graph completion (MKGC) aims to predict missing ent...
Despite the great progress of Visual Question Answering (VQA), current V...
Transformer-based models have demonstrated their effectiveness in automa...
The timely sharing of raw sensing information in the vehicular networks
...
Procedural Multimodal Documents (PMDs) organize textual instructions and...
Recently, surface electromyogram (EMG) has been proposed as a novel biom...
Self-supervised representation learning for visual pre-training has achi...
Due to the large success in object detection and instance segmentation, ...
We proposed a novel visual stimulus for brain-computer interface. The
st...