With the growing interest in pretrained vision-language models like CLIP...
We propose a simple and application-friendly network (called SimpleNet) ...
Model adaptation aims at solving the domain transfer problem under the
c...
Zero-shot learning (ZSL) aims to recognize unseen classes by generalizin...
Video semantic segmentation aims to generate accurate semantic maps for ...
In unsupervised domain adaptive (UDA) semantic segmentation, the distill...
In this work, we consider the problem of cross-domain 3D action recognit...
Attributes skew hinders the current federated learning (FL) frameworks f...
In weakly-supervised temporal action localization (WS-TAL), the methods
...
Domain adaptation (DA) aims to transfer knowledge from a label-rich sour...
In this article, we introduce the solution we used in the VSPW 2021
Chal...
Recent feature contrastive learning (FCL) has shown promising performanc...
Few-shot learning is a challenging task since only few instances are giv...
Video semantic segmentation is active in recent years benefited from the...
Sparsity and varied density are two of the main obstacles for 3D detecti...
This paper focuses on the construction of stronger local features and th...
Recent research on super-resolution has achieved great success due to th...