When designing controllers for safety-critical systems, practitioners of...
A key challenge in applying reinforcement learning to safety-critical do...
The Deep Q-Network proposed by Mnih et al. [2015] has become a benchmark...
We examine the problem of learning and planning on high-dimensional doma...
We propose a new algorithm, Mean Actor-Critic (MAC), for discrete-action...