Haonan Yu

research

∙ 02/02/2023

Policy Expansion for Bridging Offline-to-Online Reinforcement Learning

Pre-training with offline data and online fine-tuning using reinforcemen...

0 Haichao Zhang, et al. ∙

research

∙ 01/28/2022

Do You Need the Entropy Reward (in Practice)?

Maximum entropy (MaxEnt) RL maximizes a combination of the original task...

0 Haonan Yu, et al. ∙

research

∙ 01/28/2022

Towards Safe Reinforcement Learning with a Safety Editor Policy

We consider the safe reinforcement learning (RL) problem of maximizing u...

0 Haonan Yu, et al. ∙

research

∙ 01/24/2022

Generative Planning for Temporally Coordinated Exploration in Reinforcement Learning

Standard model-free reinforcement learning algorithms optimize a policy ...

0 Haichao Zhang, et al. ∙

research

∙ 04/13/2021

TASAC: Temporally Abstract Soft Actor-Critic for Continuous Control

We propose temporally abstract soft actor-critic (TASAC), an off-policy ...

0 Haonan Yu, et al. ∙

research

∙ 03/07/2021

MetaView: Few-shot Active Object Recognition

In robot sensing scenarios, instead of passively utilizing human capture...

0 Wei Wei, et al. ∙

research

∙ 01/16/2021

Hierarchical Reinforcement Learning By Discovering Intrinsic Options

We propose a hierarchical reinforcement learning method, HIDIO, that can...

0 Jesse Zhang, et al. ∙

research

∙ 07/22/2019

Why Build an Assistant in Minecraft?

In this document we describe a rationale for a research program aimed at...

2 Arthur Szlam, et al. ∙

research

∙ 07/19/2019

CraftAssist: A Framework for Dialogue-enabled Interactive Agents

This paper describes an implementation of a bot assistant in Minecraft, ...

0 Jonathan Gray, et al. ∙

research

∙ 07/07/2019

EPNAS: Efficient Progressive Neural Architecture Search

In this paper, we propose Efficient Progressive Neural Architecture Sear...

0 Yanqi Zhou, et al. ∙

research

∙ 06/06/2019

One ticket to win them all: generalizing lottery ticket initializations across datasets and optimizers

The success of lottery ticket initializations (Frankle and Carbin, 2019)...

5 Ari S. Morcos, et al. ∙

research

∙ 06/06/2019

Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP

The lottery ticket hypothesis proposes that over-parameterization of dee...

0 Haonan Yu, et al. ∙

research

∙ 06/12/2018

Resource-Efficient Neural Architect

Neural Architecture Search (NAS) is a laborious process. Prior work on a...

0 Yanqi Zhou, et al. ∙

research

∙ 05/22/2018

Guided Feature Transformation (GFT): A Neural Language Grounding Module for Embodied Agents

Recently there has been a rising interest in training agents, embodied i...

0 Haonan Yu, et al. ∙

research

∙ 04/26/2018

Interactive Language Acquisition with One-shot Visual Concept Learning through a Conversational Game

Building intelligent agents that can communicate with and learn from hum...

0 Haichao Zhang, et al. ∙

research

∙ 01/31/2018

Interactive Grounded Language Acquisition and Generalization in a 2D World

We build a virtual agent for learning language in a 2D maze-like world. ...

0 Haonan Yu, et al. ∙

research

∙ 05/28/2017

Listen, Interact and Talk: Learning to Speak via Interaction

One of the long-term goals of artificial intelligence is to build an age...

0 Haichao Zhang, et al. ∙

research

∙ 03/28/2017

A Deep Compositional Framework for Human-like Language Acquisition in Virtual Environment

We tackle a task where an agent learns to navigate in a 2D maze-like env...

0 Haonan Yu, et al. ∙

research

∙ 08/25/2015

Robot Language Learning, Generation, and Comprehension

We present a unified framework which supports grounding natural-language...

0 Daniel Paul Barrett, et al. ∙

research

∙ 06/05/2015

Sentence Directed Video Object Codetection

We tackle the problem of video object codetection by leveraging the weak...

0 Haonan Yu, et al. ∙

research

∙ 11/14/2014

A Faster Method for Tracking and Scoring Videos Corresponding to Sentences

Prior work presented the sentence tracker, a method for scoring how well...

0 Haonan Yu, et al. ∙

research

∙ 06/21/2013

Discriminative Training: Learning to Describe Video with Sentences, from Video Described with Sentences

We present a method for learning word meanings from complex and realisti...

0 Haonan Yu, et al. ∙

Haonan Yu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro