Weakly-supervised temporal action localization aims to locate action reg...
In this paper, we propose the semantic graph Transformer (SGT) for the 3...
Predicting attention regions of interest is an important yet challenging...
Video denoising aims to recover high-quality frames from the noisy video...
The ever-increasing heavy traffic congestion potentially impedes the
acc...
For a new city that is committed to promoting Electric Vehicles (EVs), i...
We address the challenging task of cross-modal moment retrieval, which a...
Deep neural networks have achieved remarkable success for video-based ac...
Recent studies on automatic neural architectures search have demonstrate...
Discovering social relations in images can make machines better interpre...
This paper is focused on the task of searching for a specific vehicle th...
Incentive mechanism design has aroused extensive attention for crowdsour...
To reduce the large computation and storage cost of a deep convolutional...
Action recognition in surveillance video makes our life safer by detecti...