Answering visual queries is a complex task that requires both visual
pro...
Synthesizing 3D human avatars interacting realistically with a scene is ...
We introduce a representation learning framework for spatial trajectorie...
We present an approach for recommending a music track for a given video,...
For computer vision systems to operate in dynamic situations, they need ...
We introduce a framework for learning from unlabeled video what is
predi...
Multi-language machine translation without parallel corpora is challengi...
When we travel, we often encounter new scenarios we have never experienc...
In this paper, we explore neural network models that learn to associate
...
The increasing amount of online videos brings several opportunities for
...
Catastrophic forgetting occurs when a neural network loses the informati...