We present Neural Space-filling Curves (SFCs), a data-driven approach to...
Video compression is a central feature of the modern internet powering
t...
Synthesizing images of a person in novel poses from a single image is a
...
Attention mechanisms have been widely applied to cross-modal tasks such ...
Urban material recognition in remote sensing imagery is a highly relevan...
Label distributions in real-world are oftentimes long-tailed and imbalan...
We present a novel architecture for 3D object detection, M3DeTR, which
c...
The standard way of training video models entails sampling at each itera...
Deep learning relies on the availability of a large corpus of data (labe...
A neural network regularizer (e.g., weight decay) boosts performance by
...
Over the past six years, deep generative models have achieved a qualitat...
Recognition tasks, such as object recognition and keypoint estimation, h...
Most existing distance metric learning approaches use fully labeled data...
With the proliferation of deep learning methods, many computer vision
pr...
One central question for video action recognition is how to model motion...
The widely adopted sequential variant of Non Maximum Suppression (or
Gre...
Retrieval networks are essential for searching and indexing. Compared to...
We address the problem of layout generation for diverse domains such as
...
The JPEG image compression algorithm is the most popular method of image...
Generative Adversarial Networks (GANs) have brought about rapid progress...
We present a systematic study of adversarial attacks on state-of-the-art...
We address the problem of distance metric learning in visual similarity
...
In this paper, we propose Spatio-TEmporal Progressive (STEP) action
dete...
We introduce an unsupervised formulation to estimate heteroscedastic
unc...
We employ triplet loss as a space embedding regularizer to boost
classif...
We cast visual retrieval as a regression problem by posing triplet loss ...
We introduce a general method of performing Residual Network inference a...
We propose novel Stacked Spatio-Temporal Graph Convolutional Networks
(S...
Research in computer graphics has been in pursuit of realistic image
gen...
Recent advances in Generative Adversarial Networks (GANs) have shown
inc...
We present a self-supervised approach using spatio-temporal signals betw...
Automatic colorization is the process of adding color to greyscale image...
Motion boundary detection is a crucial yet challenging problem. Prior me...
In recent years, it is common practice to extract fully-connected layer ...
In this paper, we introduce the Face Magnifier Network (Face-MageNet), a...
Compared to the general semantic segmentation problem, portrait segmenta...
We introduce the Single Stage Headless (SSH) face detector. Unlike two s...
Visual narrative is often a combination of explicit information and judi...
Typical textual descriptions that accompany online videos are 'weak': i....
Deep Convolutional Neural Networks (CNN) enforces supervised information...
This paper presents a structured ordinal measure method for video-based ...