Reinforcement learning with sparse rewards is still an open challenge.
C...
Actor-critic methods can achieve incredible performance on difficult
rei...
Direct contextual policy search methods learn to improve policy paramete...
This document contains supplementary material for the paper "Multi-objec...