In offline reinforcement learning (offline RL), one of the main challeng...
Double Q-learning is a classical method for reducing overestimation bias...
Episodic memory-based methods can rapidly latch onto past successful
str...
Sample efficiency has been one of the major challenges for deep reinforc...
Object-based approaches for learning action-conditioned dynamics has
dem...
Transfer learning can greatly speed up reinforcement learning for a new ...