Alexei A. Efros
Professor
Given a set of calibrated images of a scene, we present an approach that...
Do different neural networks, trained for various vision tasks, share so...
While large text-to-image models are able to synthesize "novel" images, ...
Large-scale generative models are capable of producing high-quality imag...
Dataset Distillation aims to distill an entire dataset's knowledge into ...
We propose a method for editing NeRF scenes with text-instructions. Give...
Modern vision models typically rely on fine-tuning general-purpose model...
We propose a method for editing images from human instructions: given an...
Contrastive methods have led a recent surge in the performance of
self-s...
We explore a data-driven approach for learning to optimize neural networ...
Test-time training adapts to a new test distribution on the fly by optim...
In this work, we study how the performance and evaluation of generative ...
How does one adapt a pre-trained visual model to novel downstream tasks
...
We present a video generation model that accurately reproduces object mo...
We propose an unsupervised, mid-level representation for a generative mo...
Approaches for single-view reconstruction typically rely on viewpoint
an...
Dataset distillation is the task of synthesizing a small dataset such th...
A range of video modeling tasks, from optical flow to multiple object
tr...
What does human pose tell us about a scene? We propose a task to answer ...
The goal of this work is to efficiently identify visually similar patter...
Visual content often contains recurring elements. Text is made up of gly...
Training generative models, such as GANs, on a target domain containing
...
We introduce a non-parametric approach for infinite video texture synthe...
Recent self-supervised contrastive methods have been able to produce
imp...
We propose a learning-based framework for disentangling outdoor scenes i...
In image-to-image translation, each patch in the output should reflect t...
Due to the ubiquity of smartphones, it is popular to take photos of one'...
In most real world scenarios, a policy trained by reinforcement learning...
Deep generative models have become increasingly effective at producing
r...
This paper proposes a simple self-supervised approach for learning
repre...
This paper considers the generic problem of dense alignment between two
...
In this work we ask whether it is possible to create a "universal" detec...
We introduce a general approach, called test-time training, for improvin...
This paper addresses unsupervised domain adaptation, the setting where
l...
Arnab Ghosh 6:32 PM We propose an interactive GAN-based sketch-to-image
...
Most malicious photo manipulations are created using standard image edit...
We introduce a self-supervised method for learning visual correspondence...
Our goal in this paper is to discover near duplicate patterns in large
c...
Contemporary sensorimotor learning approaches typically start with an
ex...
Model distillation aims to distill the knowledge of a complex model into...
Prediction is arguably one of the most basic functions of an intelligent...
This paper presents a simple method for "do as I do" motion transfer: gi...
Current major approaches to visual recognition follow an end-to-end
form...
Reinforcement learning algorithms rely on carefully engineering environm...
Advances in photo editing and manipulation tools have made it significan...
The current dominant paradigm for imitation learning relies on strong
su...
The thud of a bouncing ball, the onset of speech as lips open -- when vi...
Computer vision has advanced significantly that many discriminative
appr...
We present a learning framework for recovering the 3D shape, camera, and...
What makes humans so good at solving seemingly complex video games? Unli...