With the rapid development of mobile devices, modern widely-used mobile
...
We propose MHR-Net, a novel method for recovering Non-Rigid Shapes from
...
This paper proposes a VR supermarket with an intelligent recommendation,...
Deep learning approaches have provided state-of-the-art performance in m...
Pruning techniques have been successfully used in neural networks to tra...
Point cloud sequences are irregular and unordered in the spatial dimensi...
Non-contrast computed tomography (NCCT) is commonly acquired for lung ca...
Moire patterns, appearing as color distortions, severely degrade image a...
We address the problem of ground-to-satellite image geo-localization, th...
Neural networks tend to achieve better accuracy with training if they ar...
Most gait recognition methods exploit spatial-temporal representations f...
Many gait recognition methods first partition the human gait into N-part...
Efficiently quantifying renal structures can provide distinct spatial co...
Surrogate neural network-based models have been lately trained and used ...
Audio-driven one-shot talking face generation methods are usually traine...
The ability to estimate the 3D human shape and pose from images can be u...
Existing RGB-D saliency detection models do not explicitly encourage RGB...
3D Convolutional Neural Networks (CNNs) have been widely adopted for air...
We propose PR-RRN, a novel neural-network based method for Non-rigid
Str...
In this paper, we study the task of hallucinating an authentic
high-reso...
We propose an audio-driven talking-head method to generate photo-realist...
Contrastive learning has shown superior performance in embedding global ...
In this paper, we investigate the task of hallucinating an authentic
hig...
Single-image super-resolution (SR) and multi-frame SR are two ways to su...
Object goal navigation aims to steer an agent towards a target object ba...
In this paper, we propose a novel text-based talking-head video generati...
Compared to 2D object bounding-box labeling, it is very difficult for hu...
We address the problem of novel view synthesis (NVS) from a few sparse s...
Conventional face super-resolution methods usually assume testing
low-re...
Video deblurring models exploit consecutive frames to remove blurs from
...
This paper presents a new approach for synthesizing a novel street-view
...
Existing image segmentation networks mainly leverage large-scale labeled...
Object pose estimation from a single RGB image is a challenging problem ...
Existing deep neural network based salient object detection (SOD) method...
Gait recognition is one of the most important biometric technologies and...
Sign language translation (SLT) aims to interpret sign video sequences i...
We propose a novel technique to register sparse 3D scans in the absence ...
Target-driven visual navigation aims at navigating an agent towards a gi...
The availability of a large labeled dataset is a key requirement for app...
We propose a closed-loop, multi-instance control algorithm for visually
...
Cross-view geo-localization is the problem of estimating the position an...
Compared with laborious pixel-wise dense labeling, it is much easier to ...
Word-level sign language recognition (WSLR) is a fundamental task in sig...
Existing face hallucination methods based on convolutional neural networ...
Estimating a 6DOF object pose from a single image is very challenging du...
Obtaining a high-quality frontal face image from a low-resolution (LR)
n...
Vision-based sign language recognition aims at helping the hearing-impai...
Tracing back the instruction execution sequence to debug a multicore sys...
This paper addresses the problem of cross-view image based localization,...
Popular 3D scan registration projects, such as Stanford digital Michelan...