This work proposes an end-to-end approach to estimate full 3D hand pose ...
Video action detection (spatio-temporal action localization) is usually ...
Temporal Activity Detection aims to predict activity classes per frame, ...
Multi-person pose estimation and tracking serve as crucial steps for vid...
Object detection is an essential step towards holistic scene understandi...
Many real-world applications, such as city-scale traffic monitoring and
...
Attention mechanism has been shown to be effective for person
re-identif...
Dense video captioning is an extremely challenging task since accurate a...
This work addresses a novel and challenging problem of estimating the fu...
In this paper, we propose a novel object detection algorithm named "Deep...
Recently, there emerged revived interests of designing automatic program...
To accelerate research on adversarial examples and robustness of machine...
Though convolutional neural networks have achieved state-of-the-art
perf...
A key challenge in generic object detection is being to handle large
var...
Convolutional neural networks have demonstrated their powerful ability o...
Image captioning is a challenging problem owing to the complexity in
und...
Visual-semantic embedding models have been recently proposed and shown t...