Despite the tremendous progress of Masked Autoencoders (MAE) in developi...
Lane detection is one of the fundamental modules in self-driving. In thi...
Micro-video recommender systems suffer from the ubiquitous noises in use...
Vision Transformer (ViT), as a powerful alternative to Convolutional Neu...
Class imbalance distribution widely exists in real-world engineering.
Ho...
Monocular 3D object detection is one of the most challenging tasks in 3D...
For face presentation attack detection (PAD), most of the spoofing cues ...
Sign language translation as a kind of technology with profound social
s...
There is a soaring interest in the news recommendation research scenario...
Lip reading, aiming to recognize spoken sentences according to the given...
Instance segmentation can detect where the objects are in an image, but ...
It is well known that adversarial attacks can fool deep neural networks ...
Recently, people tried to use a few anomalies for video anomaly detectio...
Image-only and pseudo-LiDAR representations are commonly used for monocu...
3D object detection algorithms for autonomous driving reason about 3D
ob...
As an instance-level recognition problem, re-identification (re-ID) requ...
Multi-modal representation learning by pretraining has become an increas...
Knowledge distillation aims at obtaining a small but effective deep mode...
Adversarial training is currently the most powerful defense against
adve...
With the rise of deep learning methods, person Re-Identification (ReID)
...
How to learn a stable model under agnostic distribution shift between
tr...
LIDAR point clouds and RGB-images are both extremely essential for 3D ob...
Model fine-tuning is a widely used transfer learning approach in person
...
The remarkable progress of network embedding has led to state-of-the-art...
Machine Comprehension (MC) is one of the core problems in natural langua...
Natural Language Inference (NLI), also known as Recognizing Textual
Enta...
Open-ended video question answering aims to automatically generate the
n...
Neural network compression empowers the effective yet unwieldy deep
conv...
Bidding optimization is one of the most critical problems in online
adve...
Dialogue Act Recognition (DAR) is a challenging problem in dialogue
inte...
In this paper, we consider the problem of machine reading task when the
...
Machine Comprehension (MC) is a challenging task in Natural Language
Pro...
Predicating macroscopic influences of drugs on human body, like efficacy...
Machine comprehension(MC) style question answering is a representative
p...
Sparse support vector machine (SVM) is a popular classification techniqu...
Learning a distance function or metric on a given data manifold is of gr...