Pooling layers are essential building blocks of Convolutional Neural Net...
Few-shot instance segmentation methods are promising when labeled traini...
The variations in the temporal performance of human actions observed in
...
Temporal motion has been one of the essential components for effectively...
Training Deep Convolutional Neural Networks (CNNs) is based on the notio...
Effective processing of video input is essential for the recognition of
...
Deep convolutional networks are widely used in video action recognition....
In this work we employ multitask learning to capitalize on the structure...
CNN-based methods have been proven to work well for saliency detection o...
Egocentric vision is an emerging field of computer vision that is
charac...
Deep learning approaches have been established as the main methodology f...
Language use reveals information about who we are and how we feel1-3. On...
Many videos depict people, and it is their interactions that inform us o...