Obstacles on the sidewalk often block the path, limiting passage and
res...
In this paper we revisit some of the fundamental premises for a reinforc...
In this paper we describe an approach to semi-automatically create a lab...
We present a new Q-function operator for temporal difference (TD) learni...
Liquid democracy is a proxy voting method where proxies are delegable. W...
A significant amount of research in recent years has been dedicated towa...