Asim Kadav

research

∙ 05/16/2023

Learning Higher-order Object Interactions for Keypoint-based Video Understanding

Action recognition is an important problem that requires identifying act...

0 Yi Huang, et al. ∙

research

∙ 01/20/2022

Self-supervised Video Representation Learning with Cascade Positive Retrieval

Self-supervised video representation learning has been shown to effectiv...

8 Cheng-En Wu, et al. ∙

research

∙ 12/31/2021

SplitBrain: Hybrid Data and Model Parallel Deep Learning

The recent success of deep learning applications has coincided with thos...

14 Farley Lai, et al. ∙

research

∙ 12/11/2021

COMPOSER: Compositional Learning of Group Activity in Videos

Group Activity Recognition (GAR) detects the activity performed by a gro...

14 Honglu Zhou, et al. ∙

research

∙ 08/20/2021

Dual Projection Generative Adversarial Networks for Conditional Image Generation

Conditional Generative Adversarial Networks (cGANs) extend the standard ...

7 Ligong Han, et al. ∙

research

∙ 03/19/2021

Hopper: Multi-hop Transformer for Spatiotemporal Reasoning

This paper considers the problem of spatiotemporal object-centric reason...

8 Honglu Zhou, et al. ∙

research

∙ 05/23/2020

S3VAE: Self-Supervised Sequential VAE for Representation Disentanglement and Data Generation

We propose a sequential variational autoencoder to learn disentangled re...

31 Yizhe Zhu, et al. ∙

research

∙ 12/05/2019

15 Keypoints Is All You Need

Pose tracking is an important problem that requires identifying unique h...

28 Michael Snower, et al. ∙

research

∙ 11/05/2019

Contextual Grounding of Natural Language Entities in Images

In this paper, we introduce a contextual grounding approach that capture...

0 Farley Lai, et al. ∙

research

∙ 04/22/2019

Tripping through time: Efficient Localization of Activities in Videos

Localizing moments in untrimmed videos via language queries is a new and...

0 Meera Hahn, et al. ∙

research

∙ 01/20/2019

Visual Entailment: A Novel Task for Fine-Grained Image Understanding

Existing visual reasoning datasets such as Visual Question Answering (VQ...

0 Ning Xie, et al. ∙

research

∙ 11/26/2018

Visual Entailment Task for Visually-Grounded Language Learning

We introduce a new inference task - Visual Entailment (VE) - which diffe...

0 Ning Xie, et al. ∙

research

∙ 10/25/2018

Teaching Syntax by Adversarial Distraction

Existing entailment datasets mainly pose problems which can be answered ...

0 Juho Kim, et al. ∙

research

∙ 02/01/2018

Adaptive Memory Networks

We present Adaptive Memory Networks (AMN) that processes input-question ...

0 Daniel Li, et al. ∙

research

∙ 12/11/2017

DeepConfig: Automating Data Center Network Topologies Management with Machine Learning

In recent years, many techniques have been developed to improve the perf...

0 Christopher Streiffer, et al. ∙

research

∙ 11/16/2017

Grounded Objects and Interactions for Video Captioning

We address the problem of video captioning by grounding language generat...

0 Chih-Yao Ma, et al. ∙

research

∙ 11/16/2017

Attend and Interact: Higher-Order Object Interactions for Video Understanding

Human actions often involve complex interactions across several inter-re...

0 Chih-Yao Ma, et al. ∙

research

∙ 12/22/2016

A Context-aware Attention Network for Interactive Question Answering

Neural network based sequence-to-sequence models in an encoder-decoder f...

0 Huayu Li, et al. ∙

research

∙ 08/31/2016

Pruning Filters for Efficient ConvNets

The success of CNNs in various applications is accompanied by a signific...

0 Hao Li, et al. ∙

Asim Kadav

Featured Co-authors

Sign in with Google

Consider DeepAI Pro