Contrastive learning-based vision-language pre-training approaches, such...
The attention-based Transformers have been increasingly applied to audio...
This work studies discrete diffusion probabilistic models with applicati...
Despite the success of fully-supervised human skeleton sequence modeling...
Attention mechanisms have been widely applied to cross-modal tasks such ...
Generating radiology reports is time-consuming and requires extensive
ex...
We propose a Dynamic Graph-Based Spatial-Temporal Attention (DG-STA) met...
Sentiment analysis on large-scale social media data is important to brid...