Point-, voxel-, and range-views are three representative forms of point
...
With the growing popularity of digital twin and autonomous driving in
tr...
With the commercial application of automated vehicles (AVs), the sharing...
Existing offboard 3D detectors always follow a modular pipeline design t...
We present a novel multi-view implicit surface reconstruction technique,...
The research on extrinsic calibration between Light Detection and
Rangin...
It is a long-term vision for Autonomous Driving (AD) community that the
...
With the development of autonomous driving technology, sensor calibratio...
Although Domain Generalization (DG) problem has been fast-growing in the...
The popular VQ-VAE models reconstruct images through learning a discrete...
We propose a perception imitation method to simulate results of a certai...
Video summarization aims to distill the most important information from ...
Training deep models for semantic scene completion (SSC) is challenging ...
Current 3D object detection models follow a single dataset-specific trai...
Unsupervised Domain Adaptation (UDA) technique has been explored in 3D
c...
LiDAR segmentation is crucial for autonomous driving perception. Recent
...
Due to the complex and changing interactions in dynamic scenarios, motio...
LiDAR-camera fusion methods have shown impressive performance in 3D obje...
With the development of autonomous driving, it is becoming increasingly
...
The performance of sensors in the autonomous driving system is fundament...
Contrastive language-image pre-training (CLIP) achieves promising result...
Recently, Vehicle-to-Everything(V2X) cooperative perception has attracte...
Multi-modal 3D object detection has been an active research topic in
aut...
Accurate and reliable sensor calibration is critical for fusing LiDAR an...
In computer vision, multi-label classification, including zero-shot
mult...
This article addresses the problem of distilling knowledge from a large
...
Accurate sensor calibration is a prerequisite for multi-sensor perceptio...
Point cloud completion is a generation and estimation issue derived from...
Sensor-based environmental perception is a crucial part of the autonomou...
Neural Architecture Search (NAS) has attracted increasingly more attenti...
Sensor-based environmental perception is a crucial step for autonomous
d...
Generating images with conditional descriptions gains increasing interes...
Sensor configuration, including the sensor selections and their installa...
For autonomous vehicles, an accurate calibration for LiDAR and camera is...
Conditional text generation has been a challenging task that is yet to s...
By assigning each relationship a single label, current approaches formul...
Recognizing Video events in long, complex videos with multiple sub-activ...
Despite some exciting progress on high-quality image generation from
str...
Hand pose estimation from the monocular 2D image is challenging due to t...
This paper considers a realistic problem in person re-identification (re...
Human visual recognition of activities or external agents involves an
in...
The research on hashing techniques for visual data is gaining increased
...
In this paper, we propose a novel Question-Guided Hybrid Convolution (QG...
Generating scene graph to describe all the relations inside an image gai...
In this paper, we introduce "Power Linear Unit" (PoLU) which increases t...
Recent advances in visual activity recognition have raised the possibili...
Image completion has achieved significant progress due to advances in
ge...
Recently visual question answering (VQA) and visual question generation ...
Object detection, scene graph generation and region captioning, which ar...
As the intermediate level task connecting image captioning and object
de...