We call on the Document AI (DocAI) community to reevaluate current
metho...
Document Visual Question Answering (DocVQA) refers to the task of answer...
Pretraining has proven successful in Document Intelligence tasks where d...
In this report we present results of the ICDAR 2021 edition of the Docum...
The open-ended question answering task of Text-VQA requires reading and
...
Current tasks and methods in Document Understanding aims to process docu...
Infographics are documents designed to effectively communicate informati...
This paper presents final results of ICDAR 2019 Scene Text Visual Questi...
Current visual question answering datasets do not consider the rich sema...
Images with visual and scene text content are ubiquitous in everyday lif...
We propose a framework for automated classification of Advertisement Ima...
In this work we introduce a cross modal image retrieval system that allo...
This paper proposes a region based convolutional neural network for
segm...
In this paper we propose an approach to lexicon-free recognition of text...
In this paper we propose a segmentation-free query by string word spotti...