Prithwijit Guha

research

∙ 02/28/2023

VQA with Cascade of Self- and Co-Attention Blocks

The use of complex attention modules has improved the performance of the...

0 Aakansha Mishra, et al. ∙

research

∙ 11/02/2020

Facial Keypoint Sequence Generation from Audio

Whenever we speak, our voice is accompanied by facial movements and expr...

0 Prateek Manocha, et al. ∙

research

∙ 07/08/2020

IQ-VQA: Intelligent Visual Question Answering

Even though there has been tremendous progress in the field of Visual Qu...

7 Vatsal Goel, et al. ∙

research

∙ 02/17/2020

CQ-VQA: Visual Question Answering on Categorized Questions

This paper proposes CQ-VQA, a novel 2-level hierarchical but end-to-end ...

6 Aakansha Mishra, et al. ∙

research

∙ 11/03/2018

Time-Frequency Audio Features for Speech-Music Classification

Distinct striation patterns are observed in the spectrograms of speech a...

0 Mrinmoy Bhattacharjee, et al. ∙

research

∙ 01/09/2017

Reinforcement Learning via Recurrent Convolutional Neural Networks

Deep Reinforcement Learning has enabled the learning of policies for com...

0 Tanmay Shankar, et al. ∙

research

∙ 04/02/2016

Overlay Text Extraction From TV News Broadcast

The text data present in overlaid bands convey brief descriptions of new...

0 Raghvendra Kannao, et al. ∙

research

∙ 07/05/2015

TV News Commercials Detection using Success based Locally Weighted Kernel Combination

Commercial detection in news broadcast videos involves judicious selecti...

0 Raghvendra Kannao, et al. ∙

research

∙ 01/25/2015

An Occlusion Reasoning Scheme for Monocular Pedestrian Tracking in Dynamic Scenes

This paper looks into the problem of pedestrian tracking using a monocul...

0 Sourav Garg, et al. ∙

Prithwijit Guha

Featured Co-authors

Sign in with Google

Consider DeepAI Pro