Chenjun Xiao

research

∙ 06/09/2023

In-Sample Policy Iteration for Offline Reinforcement Learning

Offline reinforcement learning (RL) seeks to derive an effective control...

0 Xiaohan Hu, et al. ∙

research

∙ 03/16/2023

Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning

Efficient exploration is critical in cooperative deep Multi-Agent Reinfo...

0 Xutong Zhao, et al. ∙

research

∙ 02/28/2023

The In-Sample Softmax for Offline Reinforcement Learning

Reinforcement learning (RL) agents can leverage batches of previously co...

0 Chenjun Xiao, et al. ∙

research

∙ 12/17/2022

Latent Variable Representation for Reinforcement Learning

Deep latent variable models have achieved significant empirical successe...

10 Tongzheng Ren, et al. ∙

research

∙ 10/29/2021

Understanding the Effect of Stochasticity in Policy Optimization

We study the effect of stochasticity in on-policy policy optimization, a...

0 Jincheng Mei, et al. ∙

research

∙ 06/18/2021

On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data

We study the fundamental question of the sample complexity of learning a...

0 Chenjun Xiao, et al. ∙

research

∙ 04/06/2021

On the Optimality of Batch Policy Optimization Algorithms

Batch policy optimization considers leveraging existing data for policy ...

0 Chenjun Xiao, et al. ∙

research

∙ 05/13/2020

On the Global Convergence Rates of Softmax Policy Gradient Methods

We make three contributions toward better understanding policy gradient ...

4 Jincheng Mei, et al. ∙

research

∙ 12/24/2019

Learning to Combat Compounding-Error in Model-Based Reinforcement Learning

Despite its potential to improve sample complexity versus model-free app...

0 Chenjun Xiao, et al. ∙

Chenjun Xiao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro