Mirror descent value iteration (MDVI), an abstraction of Kullback-Leible...
Projection-free online learning has drawn increasing interest due to its...
We propose a novel generalization of constrained Markov decision process...
Robust Markov Decision Processes (MDPs) are getting more attention for
l...
In an Markov decision process (MDP), unobservable confounders may exist ...
In this work, we consider and analyze the sample complexity of model-fre...
Pluralistic image completion focuses on generating both visually realist...
We study a Federated Reinforcement Learning (FedRL) problem in which n
a...
Anomaly detection on attributed networks is widely used in web shopping,...
In this paper, we study the non-asymptotic performance of optimal policy...
Communication efficiency plays a significant role in decentralized
optim...
Federated learning enables a large amount of edge computing devices to l...
We propose and study a general framework for regularized Markov decision...