Recent DETR-based video grounding models have made the model directly pr...
Modern data augmentation using a mixture-based technique can regularize ...
In this paper, we efficiently transfer the surpassing representation pow...
Online Temporal Action Localization (On-TAL) aims to immediately provide...
Given an untrimmed video and a language query depicting a specific tempo...
Online stereo adaptation tackles the domain shift problem, caused by
dif...
This paper presents Probabilistic Video Contrastive Learning, a
self-sup...
The rise of deep neural networks has led to several breakthroughs for
se...
Domain generalization aims to learn a prediction model on multi-domain s...
This paper presents a novel method, termed Bridge to Answer, to infer co...
Existing techniques to adapt semantic segmentation networks across the s...
The goal of video summarization is to select keyframes that are visually...
Traditional techniques for emotion recognition have focused on the facia...