We present a framework for generating appropriate facial responses from ...
We present a framework that formulates visual question answering as modu...
Training a referring expression comprehension (ReC) model for a new visu...
Understanding the relationship between figures and text is key to scient...
Answering questions that involve multi-step reasoning requires decomposi...
Neural module networks (NMNs) are a popular approach for modeling
compos...
Standard test sets for supervised learning evaluate in-distribution
gene...
Neural NLP models are increasingly accurate but are imperfect and
opaque...
Determining temporal relations (e.g., before or after) between events ha...
Several clustering frameworks with interactive (semi-supervised) queries...
In order for coreference resolution systems to be useful in practice, th...
People are often entities of interest in tasks such as search and inform...