Point-, voxel-, and range-views are three representative forms of point
...
Human-centric scene understanding is significant for real-world applicat...
Zero-shot point cloud segmentation aims to make deep models capable of
r...
Recent advancements in vision foundation models (VFMs) have opened up ne...
Vision foundation models such as Contrastive Vision-Language Pre-trainin...
The robustness of 3D perception systems under natural corruptions from
e...
Recently, methods for neural surface representation and rendering, for
e...
LiDAR segmentation is crucial for autonomous driving perception. Recent
...
Contrastive language-image pre-training (CLIP) achieves promising result...
We investigate transductive zero-shot point cloud semantic segmentation ...
Promising performance has been achieved for visual perception on the poi...
Personalized video highlight detection aims to shorten a long video to
i...
We study the problem of weakly supervised grounded image captioning. Tha...
Detecting 3D landmarks on cone-beam computed tomography (CBCT) is crucia...
Well-annotated medical images are costly and sometimes even impossible t...
Learning structures of 3D shapes is a fundamental problem in the field o...
Marking anatomical landmarks in cephalometric radiography is a critical
...
In this paper, we propose a general framework for image classification u...