Seeing is believing, however, the underlying mechanism of how human visu...
Vector graphics (VG) have been ubiquitous in our daily life with vast
ap...
As an important task in multimodal context understanding, Text-VQA (Visu...
Humans convey their intentions through the usage of both verbal and nonv...
Story ending generation is a strong indication of story comprehension. T...
Asking good questions in large-scale, open-domain conversational systems...