Ernest Valveny

research

∙ 05/15/2023

Document Understanding Dataset and Evaluation (DUDE)

We call on the Document AI (DocAI) community to reevaluate current metho...

0 Jordy Landeghem, et al. ∙

research

∙ 12/07/2022

Hierarchical multimodal transformers for Multi-Page DocVQA

Document Visual Question Answering (DocVQA) refers to the task of answer...

0 Ruben Tito, et al. ∙

research

∙ 02/25/2022

OCR-IDL: OCR Annotations for Industry Document Library Dataset

Pretraining has proven successful in Document Intelligence tasks where d...

0 Ali Furkan Biten, et al. ∙

research

∙ 11/10/2021

ICDAR 2021 Competition on Document VisualQuestion Answering

In this report we present results of the ICDAR 2021 edition of the Docum...

0 Ruben Tito, et al. ∙

research

∙ 08/22/2021

External Knowledge enabled Text Visual Question Answering

The open-ended question answering task of Text-VQA requires reading and ...

4 Arka Ujjal Dey, et al. ∙

research

∙ 04/27/2021

Document Collection Visual Question Answering

Current tasks and methods in Document Understanding aims to process docu...

0 Ruben Tito, et al. ∙

research

∙ 04/26/2021

InfographicVQA

Infographics are documents designed to effectively communicate informati...

12 Minesh Mathew, et al. ∙

research

∙ 06/30/2019

ICDAR 2019 Competition on Scene Text Visual Question Answering

This paper presents final results of ICDAR 2019 Scene Text Visual Questi...

0 Ali Furkan Biten, et al. ∙

research

∙ 05/31/2019

Scene Text Visual Question Answering

Current visual question answering datasets do not consider the rich sema...

0 Ali Furkan Biten, et al. ∙

research

∙ 05/25/2019

Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding

Images with visual and scene text content are ubiquitous in everyday lif...

0 Arka Ujjal Dey, et al. ∙

research

∙ 06/21/2018

Don't only Feel Read: Using Scene text to understand advertisements

We propose a framework for automated classification of Advertisement Ima...

0 Arka Ujjwal dey, et al. ∙

research

∙ 04/28/2018

Learning Cross-Modal Deep Embeddings for Multi-Object Image Retrieval using Text and Sketch

In this work we introduce a cross modal image retrieval system that allo...

0 Sounak Dey, et al. ∙

research

∙ 07/05/2017

R-PHOC: Segmentation-Free Word Spotting using CNN

This paper proposes a region based convolutional neural network for segm...

0 Suman Ghosh, et al. ∙

research

∙ 06/05/2017

Visual attention models for scene text recognition

In this paper we propose an approach to lexicon-free recognition of text...

0 Suman K. Ghosh, et al. ∙

research

∙ 05/28/2015

Query by String word spotting based on character bi-gram indexing

In this paper we propose a segmentation-free query by string word spotti...

0 Suman K. Ghosh, et al. ∙

Ernest Valveny

Featured Co-authors

Sign in with Google

Consider DeepAI Pro