The ability to learn continually is essential in a complex and changing
...
Off-policy learning from multistep returns is crucial for sample-efficie...
Selecting exploratory actions that generate a rich stream of experience ...
In many, if not every realistic sequential decision-making task, the
dec...
In this paper we investigate the properties of representations learned b...
In reinforcement learning, the graph Laplacian has proved to be a valuab...
To achieve the ambitious goals of artificial intelligence, reinforcement...
Research on exploration in reinforcement learning, as applied to Atari 2...
We use functional mirror ascent to propose a general framework (referred...
Reinforcement learning methods trained on few environments rarely learn
...
This paper provides an empirical evaluation of recently developed explor...
Deep reinforcement learning (RL) algorithms have shown an impressive abi...
The problem of exploration in reinforcement learning is well-understood ...
Here we propose using the successor representation (SR) to accelerate
le...
Eigenoptions (EOs) have been recently introduced as a promising idea for...
Options in reinforcement learning allow agents to hierarchically decompo...
Agents of general intelligence deployed in real-world scenarios must ada...
The temporal-difference methods TD(λ) and Sarsa(λ) form a
core part of m...
In Reinforcement Learning (RL), it is common to use optimistic initializ...
AI is gradually receiving more attention as a fundamental feature to inc...