Researchers have extensively studied the field of vision and language,
d...
Text and signs around roads provide crucial information for drivers, vit...
Video Question Answering methods focus on commonsense reasoning and visu...
Recognition of text on word or line images, without the need for sub-wor...
In this report we present results of the ICDAR 2021 edition of the Docum...
This work addresses the problem of Question Answering (QA) on handwritte...
Infographics are documents designed to effectively communicate informati...
Inspired by the success of Deep Learning based approaches to English sce...
Images in the medical domain are fundamentally different from the genera...
This paper presents results of Document Visual Question Answering Challe...
We present a new dataset for Visual Question Answering on document image...
Perceiving text is crucial to understand semantics of outdoor scenes and...
This paper presents final results of ICDAR 2019 Scene Text Visual Questi...
Word error rate of an ocr is often higher than its character error rate....
Building robust recognizers for Arabic has always been challenging. We
d...