Large-scale object detection and instance segmentation face a severe dat...
Detection Transformer (DETR) directly transforms queries to unique objec...
We present cross-view transformers, an efficient attention-based model f...
In this paper, we present a system to train driving policies from experi...
Lidar-based sensing drives current autonomous vehicles. Despite rapid
pr...
Our world offers a never-ending stream of visual stimuli, yet today's vi...
We learn an interactive vision-based driving policy from pre-recorded dr...
We develop a probabilistic interpretation of two-stage object detection....
How do we build a general and broad object detection system? We use all
...
Deep learning is slowly, but steadily, hitting a memory bottleneck. Whil...
Deep networks devour millions of precisely annotated images to build the...
Three-dimensional objects are commonly represented as 3D boxes in a
poin...
We introduce a simple and efficient lossless image compression algorithm...
Tracking has traditionally been the art of following interest points thr...
Vision-based urban driving is hard. The autonomous system needs to learn...
Training competitive deep video models is an order of magnitude slower t...
Computer vision produces representations of scene content. Much computer...
Convolutions on monocular dash cam videos capture spatial invariances in...
Detection identifies objects as axis-aligned boxes in an image. Most
suc...
With the advent of deep learning, object detection drifted from a bottom...
To understand the world, we humans constantly need to relate the present...
3D vehicle detection and tracking from a monocular camera requires detec...
Deep reinforcement learning (RL) has achieved breakthrough results on ma...
An ever increasing amount of our digital communication, media consumptio...
Training robust deep video representations has proven to be much more
ch...
Deep embeddings answer one simple question: How similar are two images?
...
Realistic image manipulation is challenging because it requires modifyin...
The ability of the Generative Adversarial Networks (GANs) framework to l...
We present an unsupervised visual feature learning algorithm driven by
c...
Discriminative deep learning approaches have shown impressive results fo...
We propose a data-driven approach for intrinsic image decomposition, whi...
We present an approach to learn a dense pixel-wise labeling from image-l...
Most state-of-the-art techniques for multi-class image segmentation and
...