Xuefeng Gao

research

∙ 01/30/2023

Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents

The optimized certainty equivalent (OCE) is a family of risk measures th...

0 Wenhao Xu, et al. ∙

research

∙ 10/03/2022

Square-root regret bounds for continuous-time episodic Markov decision processes

We study reinforcement learning for continuous-time Markov decision proc...

0 Xuefeng Gao, et al. ∙

research

∙ 05/23/2022

Logarithmic regret bounds for continuous-time average-reward Markov decision processes

We consider reinforcement learning for continuous-time Markov decision p...

0 Xuefeng Gao, et al. ∙

research

∙ 07/31/2021

Debiasing Samples from Online Learning Using Bootstrap

It has been recently shown in the literature that the sample averages fr...

2 Ningyuan Chen, et al. ∙

research

∙ 07/08/2021

Sublinear Regret for Learning POMDPs

We study the model-based undiscounted reinforcement learning for partial...

7 Yi Xiong, et al. ∙

research

∙ 07/01/2020

Decentralized Stochastic Gradient Langevin Dynamics and Hamiltonian Monte Carlo

Stochastic gradient Langevin dynamics (SGLD) and stochastic gradient Ham...

0 Mert Gurbuzbalaban, et al. ∙

research

∙ 04/06/2020

Non-Convex Stochastic Optimization via Non-Reversible Stochastic Gradient Langevin Dynamics

Stochastic gradient Langevin dynamics (SGLD) is a poweful algorithm for ...

0 Yuanhan Hu, et al. ∙

research

∙ 01/26/2020

Regime Switching Bandits

We study a multi-armed bandit problem where the rewards exhibit regime-s...

6 Xiang Zhou, et al. ∙

research

∙ 12/19/2018

Breaking Reversibility Accelerates Langevin Dynamics for Global Non-Convex Optimization

Langevin dynamics (LD) has been proven to be a powerful technique for op...

0 Xuefeng Gao, et al. ∙

research

∙ 09/12/2018

Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Non-Convex Stochastic Optimization: Non-Asymptotic Performance Bounds and Momentum-Based Acceleration

Stochastic gradient Hamiltonian Monte Carlo (SGHMC) is a variant of stoc...

0 Xuefeng Gao, et al. ∙

Xuefeng Gao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro