Sequential user modeling, a critical task in personalized recommender
sy...
As a basic research problem for building effective recommender systems,
...
To obtain higher sample efficiency and superior final performance
simult...
The Arcade Learning Environment (ALE) is proposed as an evaluation platf...
Deep Q Network (DQN) firstly kicked the door of deep reinforcement learn...
Policy-based reinforcement learning methods suffer from the policy colla...
Building on the breakthrough of reinforcement learning, this paper intro...
Constructing agents with planning capabilities has long been one of the ...