Anjana Arunkumar

research

∙ 07/20/2023

Image or Information? Examining the Nature and Impact of Visualization Perceptual Classification

How do people internalize visualizations: as images or information? In t...

0 Anjana Arunkumar, et al. ∙

research

∙ 04/12/2023

LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity

Cross-task generalization is a significant outcome that defines mastery ...

0 Anjana Arunkumar, et al. ∙

research

∙ 04/04/2023

PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models

Large Language Models (LLMs) have gained widespread popularity due to th...

0 Aditi Mishra, et al. ∙

research

∙ 02/09/2023

Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow

Recent research has shown that language models exploit `artifacts' in be...

2 Anjana Arunkumar, et al. ∙

research

∙ 10/14/2022

Hardness of Samples Need to be Quantified for a Reliable Evaluation System: Exploring Potential Opportunities with a New Task

Evaluation of models on benchmarks is unreliable without knowing the deg...

1 Swaroop Mishra, et al. ∙

research

∙ 10/14/2022

A Survey of Parameters Associated with the Quality of Benchmarks in NLP

Several benchmarks have been built with heavy investment in resources to...

12 Swaroop Mishra, et al. ∙

research

∙ 10/10/2022

Investigating the Failure Modes of the AUC metric and Exploring Alternatives for Evaluating Systems in Safety Critical Applications

With the increasing importance of safety requirements associated with th...

9 Swaroop Mishra, et al. ∙

research

∙ 09/08/2022

PMU Tracker: A Visualization Platform for Epicentric Event Propagation Analysis in the Power Grid

The electrical power grid is a critical infrastructure, with disruptions...

0 Anjana Arunkumar, et al. ∙

research

∙ 03/12/2022

A Proposal to Study "Is High Quality Data All We Need?"

Even though deep neural models have achieved superhuman performance on m...

10 Swaroop Mishra, et al. ∙

research

∙ 08/13/2021

Bayesian Modelling of Alluvial Diagram Complexity

Alluvial diagrams are a popular technique for visualizing flow and relat...

0 Anjana Arunkumar, et al. ∙

research

∙ 06/10/2021

Front Contribution instead of Back Propagation

Deep Learning's outstanding track record across several domains has stem...

48 Swaroop Mishra, et al. ∙

research

∙ 06/10/2021

How Robust are Model Rankings: A Leaderboard Customization Approach for Equitable Evaluation

Models that top leaderboards often perform unsatisfactorily when deploye...

8 Swaroop Mishra, et al. ∙

research

∙ 08/10/2020

DQI: A Guide to Benchmark Evaluation

A `state of the art' model A surpasses humans in a benchmark B, but fail...

11 Swaroop Mishra, et al. ∙

research

∙ 07/14/2020

Our Evaluation Metric Needs an Update to Encourage Generalization

Models that surpass human performance on several popular benchmarks disp...

6 Swaroop Mishra, et al. ∙

research

∙ 05/02/2020

DQI: Measuring Data Quality in NLP

Neural language models have achieved human level performance across seve...

0 Swaroop Mishra, et al. ∙

Anjana Arunkumar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro