Designing reinforcement learning (RL) agents is typically a difficult pr...
We present Γ-nets, a method for generalizing value function estimation
o...
The Deep Q-Network proposed by Mnih et al. [2015] has become a benchmark...
We propose a new task-specification language for Markov decision process...
For agents and robots to become more useful, they must be able to quickl...