Exploration is an essential component of reinforcement learning algorith...
We study the problem of Safe Policy Improvement (SPI) under constraints ...
A major challenge in reinforcement learning is the design of exploration...
Although Reinforcement Learning (RL) algorithms have found tremendous su...
Randomized value functions offer a promising approach towards the challe...
Current reinforcement learning (RL) methods can successfully learn singl...