Arjun Majumdar

research

∙ 07/20/2023

Behavioral Analysis of Vision-and-Language Navigation Agents

To be successful, Vision-and-Language Navigation (VLN) agents must be ab...

0 Zijiao Yang, et al. ∙

research

∙ 05/04/2023

Masked Trajectory Models for Prediction, Representation, and Control

We introduce Masked Trajectory Models (MTM) as a generic abstraction for...

0 Philipp Wu, et al. ∙

research

∙ 03/31/2023

Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?

We present the largest and most comprehensive empirical study of pre-tra...

0 Arjun Majumdar, et al. ∙

research

∙ 03/14/2023

OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav

We present a single neural network architecture composed of task-agnosti...

0 Karmesh Yadav, et al. ∙

research

∙ 06/24/2022

ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings

We present a scalable approach for learning open-world object-goal navig...

4 Arjun Majumdar, et al. ∙

research

∙ 04/27/2022

Offline Visual Representation Learning for Embodied Navigation

How should we learn visual representations for embodied agents that must...

0 Karmesh Yadav, et al. ∙

research

∙ 10/27/2021

SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation

Natural language instructions for visual navigation often use scene desc...

7 Abhinav Moudgil, et al. ∙

research

∙ 11/07/2020

Sim-to-Real Transfer for Vision-and-Language Navigation

We study the challenging problem of releasing a robot in a previously un...

3 Peter Anderson, et al. ∙

research

∙ 04/30/2020

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web

Following a navigation instruction such as 'Walk down the stairs and sto...

10 Arjun Majumdar, et al. ∙

research

∙ 04/06/2020

Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments

We develop a language-guided navigation task set in a continuous 3D envi...

3 Jacob Krantz, et al. ∙

research

∙ 03/14/2018

Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning

Visual question answering requires high-order reasoning about an image, ...

0 David Mascharka, et al. ∙

Arjun Majumdar

Featured Co-authors

Sign in with Google

Consider DeepAI Pro