We present a framework that formulates visual question answering as modu...
Given the enormous number of instructional videos available online, lear...
YouTube users looking for instructions for a specific task may spend a l...
A generic video summary is an abridged version of a video that conveys t...
We introduce a non-parametric approach for infinite video texture synthe...
We introduce a learning-based approach for room navigation using semanti...
Accurately answering a question about a given image requires combining
o...
Question answering is an important task for autonomous agents and virtua...