Wenhao Yang

research

∙ 05/22/2023

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Mirror descent value iteration (MDVI), an abstraction of Kullback-Leible...

0 Toshinori Kitamura, et al. ∙

research

∙ 05/19/2023

Non-stationary Projection-free Online Learning with Dynamic and Adaptive Regret Guarantees

Projection-free online learning has drawn increasing interest due to its...

0 Yibo Wang, et al. ∙

research

∙ 04/29/2023

Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning

We propose a novel generalization of constrained Markov decision process...

11 Liangyu Zhang, et al. ∙

research

∙ 02/02/2023

Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model

Robust Markov Decision Processes (MDPs) are getting more attention for l...

0 Wenhao Yang, et al. ∙

research

∙ 09/12/2022

Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach

In an Markov decision process (MDP), unobservable confounders may exist ...

0 Miao Lu, et al. ∙

research

∙ 05/27/2022

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal

In this work, we consider and analyze the sample complexity of model-fre...

6 Tadashi Kozuno, et al. ∙

research

∙ 05/18/2022

Pluralistic Image Completion with Probabilistic Mixture-of-Experts

Pluralistic image completion focuses on generating both visually realist...

4 Xiaobo Xia, et al. ∙

research

∙ 04/06/2022

Federated Reinforcement Learning with Environment Heterogeneity

We study a Federated Reinforcement Learning (FedRL) problem in which n a...

1 Hao Jin, et al. ∙

research

∙ 01/08/2022

AnomMAN: Detect Anomaly on Multi-view Attributed Networks

Anomaly detection on attributed networks is widely used in web shopping,...

35 Ling-Hao Chen, et al. ∙

research

∙ 05/09/2021

Non-asymptotic Performances of Robust Markov Decision Processes

In this paper, we study the non-asymptotic performance of optimal policy...

4 Wenhao Yang, et al. ∙

research

∙ 10/21/2019

Communication Efficient Decentralized Training with Multiple Local Updates

Communication efficiency plays a significant role in decentralized optim...

0 Xiang Li, et al. ∙

research

∙ 07/04/2019

On the Convergence of FedAvg on Non-IID Data

Federated learning enables a large amount of edge computing devices to l...

0 Xiang Li, et al. ∙

research

∙ 03/02/2019

A Unified Framework for Regularized Reinforcement Learning

We propose and study a general framework for regularized Markov decision...

0 Xiang Li, et al. ∙

Wenhao Yang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro