The use of complex attention modules has improved the performance of the...
Whenever we speak, our voice is accompanied by facial movements and
expr...
Even though there has been tremendous progress in the field of Visual
Qu...
This paper proposes CQ-VQA, a novel 2-level hierarchical but end-to-end ...
Distinct striation patterns are observed in the spectrograms of speech a...
Deep Reinforcement Learning has enabled the learning of policies for com...
The text data present in overlaid bands convey brief descriptions of new...
Commercial detection in news broadcast videos involves judicious selecti...
This paper looks into the problem of pedestrian tracking using a monocul...