The field of generative AI has a transformative impact on various areas,...
This paper shows that Masking the Deep hierarchical features is an effic...
Recently, perception task based on Bird's-Eye View (BEV) representation ...
Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes...
Human-Object Interaction (HOI) detection aims to learn how human interac...
Document-level natural language inference (DOCNLI) is a new challenging ...
Unsupervised reinforcement learning aims at learning a generalist policy...
Bayesian Optimization (BO) is a common solution to search optimal
hyperp...
Existing Binary Neural Networks (BNNs) mainly operate on local convoluti...
With the rapid development of automatic fake news detection technology, ...
Mainstream object detectors are commonly constituted of two sub-tasks,
i...
Distracted driving causes thousands of deaths per year, and how to apply...
Self-supervised Masked Autoencoders (MAE) are emerging as a new pre-trai...
Contrastive Language-Image Pretraining (CLIP) has emerged as a novel par...
Sequence ordering of word vector matters a lot to text reading, which ha...
Recently, self-supervised vision transformers have attracted unprecedent...
Unsupervised sentence embedding aims to obtain the most appropriate embe...
Recently, large-scale Contrastive Language-Image Pre-training (CLIP) has...