In everyday conversations, humans can take on different roles and adapt ...
To generate proper captions for videos, the inference needs to identify
...
Providing explanations in the context of Visual Question Answering (VQA)...
Recently, an increasing number of works have introduced models capable o...
Transferring learned models to novel tasks is a challenging problem,
par...