Minesh Mathew

research

∙ 09/04/2023

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering

Researchers have extensively studied the field of vision and language, d...

0 Soumya Jahagirdar, et al. ∙

research

∙ 07/08/2023

Reading Between the Lanes: Text VideoQA on the Road

Text and signs around roads provide crucial information for drivers, vit...

0 George Tom, et al. ∙

research

∙ 11/10/2022

Watching the News: Towards VideoQA Models that can Read

Video Question Answering methods focus on commonsense reasoning and visu...

0 Soumya Jahagirdar, et al. ∙

research

∙ 05/13/2022

An empirical study of CTC based models for OCR of Indian languages

Recognition of text on word or line images, without the need for sub-wor...

13 Minesh Mathew, et al. ∙

research

∙ 11/10/2021

ICDAR 2021 Competition on Document VisualQuestion Answering

In this report we present results of the ICDAR 2021 edition of the Docum...

0 Ruben Tito, et al. ∙

research

∙ 10/02/2021

Asking questions on handwritten document collections

This work addresses the problem of Question Answering (QA) on handwritte...

0 Minesh Mathew, et al. ∙

research

∙ 04/26/2021

InfographicVQA

Infographics are documents designed to effectively communicate informati...

12 Minesh Mathew, et al. ∙

research

∙ 04/09/2021

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam

Inspired by the success of Deep Learning based approaches to English sce...

1 Minesh Mathew, et al. ∙

research

∙ 04/03/2021

MMBERT: Multimodal BERT Pretraining for Improved Medical VQA

Images in the medical domain are fundamentally different from the genera...

17 Yash Khare, et al. ∙

research

∙ 08/20/2020

Document Visual Question Answering Challenge 2020

This paper presents results of Document Visual Question Answering Challe...

2 Minesh Mathew, et al. ∙

research

∙ 07/01/2020

DocVQA: A Dataset for VQA on Document Images

We present a new dataset for Visual Question Answering on document image...

24 Minesh Mathew, et al. ∙

research

∙ 05/19/2020

RoadText-1K: Text Detection Recognition Dataset for Driving Videos

Perceiving text is crucial to understand semantics of outdoor scenes and...

8 Sangeeth Reddy, et al. ∙

research

∙ 06/30/2019

ICDAR 2019 Competition on Scene Text Visual Question Answering

This paper presents final results of ICDAR 2019 Scene Text Visual Questi...

0 Ali Furkan Biten, et al. ∙

research

∙ 05/28/2019

A Cost Efficient Approach to Correct OCR Errors in Large Document Collections

Word error rate of an ocr is often higher than its character error rate....

0 Deepayan Das, et al. ∙

research

∙ 11/07/2017

Unconstrained Scene Text and Video Text Recognition for Arabic Script

Building robust recognizers for Arabic has always been challenging. We d...

0 Mohit Jain, et al. ∙

Minesh Mathew

Featured Co-authors

Sign in with Google

Consider DeepAI Pro