Creating visually pleasing stylized ink paintings from 3D models is a
ch...
In this lecture, we present a general perspective on reinforcement learn...
Popular reinforcement learning (RL) algorithms tend to produce a unimoda...
This paper designs a helper-assisted resource allocation strategy in
non...
In this paper, we investigate the optimal probabilistic constellation sh...
Safe reinforcement learning aims to learn the optimal policy while satis...
In this paper, we propose a Thompson Sampling algorithm for unimodal
ban...
The goal of policy-based reinforcement learning (RL) is to search the ma...
Full-sampling (e.g., Q-learning) and pure-expectation (e.g., Expected Sa...
In recent years significant progress has been made in dealing with
chall...
Stochastic mirror descent (SMD) keeps the advantages of simplicity of
im...
Off-policy learning is powerful for reinforcement learning. However, the...
Off-policy reinforcement learning with eligibility traces is challenging...
In this paper, a new meta-heuristic algorithm, called beetle swarm
optim...
In this paper, we focus on policy discrepancy in return-based deep Q-net...
Recently, a new multi-step temporal learning algorithm, called Q(σ),
uni...