Adriana Kovashka

research

∙ 06/11/2023

Impact of Experiencing Misrecognition by Teachable Agents on Learning and Rapport

While speech-enabled teachable agents have some advantages over typing-b...

0 Yuya Asano, et al. ∙

research

∙ 04/25/2023

Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining

Named entities are ubiquitous in text that naturally accompanies images,...

0 Giacomo Nebbia, et al. ∙

research

∙ 03/20/2023

Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated Depth

Despite recent attention and exploration of depth for various tasks, it ...

0 Cagri Gungor, et al. ∙

research

∙ 03/17/2023

Enhancing the Role of Context in Region-Word Alignment for Object Detection

Vision-language pretraining to learn a fine-grained, region-word alignme...

0 Kyle Buettner, et al. ∙

research

∙ 03/16/2023

VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection

The use of large-scale vision-language datasets is limited for object de...

0 Arushi Rai, et al. ∙

research

∙ 03/09/2023

Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors

Human-object interaction (HOI) detection aims to extract interacting hum...

0 Mesut Erhan Unal, et al. ∙

research

∙ 12/09/2022

Contrastive View Design Strategies to Enhance Robustness to Domain Shifts in Downstream Object Detection

Contrastive learning has emerged as a competitive pretraining method for...

0 Kyle Buettner, et al. ∙

research

∙ 09/23/2022

Comparison of Lexical Alignment with a Teachable Robot in Human-Robot and Human-Human-Robot Interactions

Speakers build rapport in the process of aligning conversational behavio...

0 Yuya Asano, et al. ∙

research

∙ 06/10/2022

Symbolic image detection using scene and knowledge graphs

Sometimes the meaning conveyed by images goes beyond the list of objects...

0 Nasrin Kalanat, et al. ∙

research

∙ 05/12/2022

Weakly-Supervised Action Detection Guided by Audio Narration

Videos are more well-organized curated data sources for visual concept l...

0 Keren Ye, et al. ∙

research

∙ 09/20/2021

Characterizing User Susceptibility to COVID-19 Misinformation on Twitter

Though significant efforts such as removing false claims and promoting r...

0 Xian Teng, et al. ∙

research

∙ 06/24/2021

Exploring Corruption Robustness: Inductive Biases in Vision Transformers and MLP-Mixers

Recently, vision transformers and MLP-based models have been developed i...

0 Katelyn Morrison, et al. ∙

research

∙ 05/28/2021

Linguistic Structures as Weak Supervision for Visual Scene Graph Generation

Prior work in scene graph generation requires categorical supervision at...

0 Keren Ye, et al. ∙

research

∙ 05/07/2021

BasisNet: Two-stage Model Synthesis for Efficient Inference

In this work, we present BasisNet which combines recent advancements in ...

14 Mingda Zhang, et al. ∙

research

∙ 03/29/2021

Domain-robust VQA with diverse datasets and methods but no target labels

The observation that computer vision methods overfit to dataset specific...

0 Mingda Zhang, et al. ∙

research

∙ 01/04/2021

SpotPatch: Parameter-Efficient Transfer Learning for Mobile Object Detection

Deep learning based object detectors are commonly deployed on mobile dev...

1 Keren Ye, et al. ∙

research

∙ 12/03/2020

Learning to Transfer Visual Effects from Videos to Images

We study the problem of animating images by transferring spatio-temporal...

0 Christopher Thomas, et al. ∙

research

∙ 07/16/2020

Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval

The abundance of multimodal data (e.g. social media posts) has inspired ...

0 Christopher Thomas, et al. ∙

research

∙ 10/31/2019

Predicting the Politics of an Image Using Webly Supervised Data

The news media shape public opinion, and often, the visual bias they con...

19 Christopher Thomas, et al. ∙

research

∙ 07/23/2019

Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection

Learning to localize and name object instances is a fundamental problem ...

4 Keren Ye, et al. ∙

research

∙ 01/15/2019

Measuring Effectiveness of Video Advertisements

Advertisements are unavoidable in modern society. Times Square is notori...

4 James Hahn, et al. ∙

research

∙ 12/28/2018

Artistic Object Recognition by Unsupervised Style Adaptation

Computer vision systems currently lack the ability to reliably recognize...

0 Christopher Thomas, et al. ∙

research

∙ 11/25/2018

Learning to discover and localize visual objects with open vocabulary

To alleviate the cost of obtaining accurate bounding boxes for training ...

1 Keren Ye, et al. ∙

research

∙ 07/29/2018

Story Understanding in Video Advertisements

In order to resonate with the viewers, many video advertisements explore...

7 Keren Ye, et al. ∙

research

∙ 07/25/2018

Persuasive Faces: Generating Faces in Advertisements

In this paper, we examine the visual variability of objects across diffe...

2 Christopher Thomas, et al. ∙

research

∙ 07/21/2018

Equal But Not The Same: Understanding the Implicit Relationship Between Persuasive Images and Text

Images and text in advertisements interact in complex, non-literal ways....

0 Mingda Zhang, et al. ∙

research

∙ 05/08/2018

Image Retrieval with Mixed Initiative and Multimodal Feedback

How would you search for a unique, fashionable shoe that a friend wore a...

0 Nils Murrugarra-Llerena, et al. ∙

research

∙ 11/17/2017

ADVISE: Symbolism and External Knowledge for Decoding Advertisements

In order to convey the most content in their limited space, advertisemen...

1 Keren Ye, et al. ∙

research

∙ 07/10/2017

Automatic Understanding of Image and Video Advertisements

There is more to images than their objective physical content: for examp...

1 Zaeem Hussain, et al. ∙

research

∙ 11/07/2016

Crowdsourcing in Computer Vision

Computer vision systems require large amounts of manually annotated data...

0 Adriana Kovashka, et al. ∙

Adriana Kovashka

Featured Co-authors

Sign in with Google

Consider DeepAI Pro