research
∙
11/12/2014
On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence
We provide non-asymptotic bounds for the well-known temporal difference ...
research
∙
01/08/2014
Actor-Critic Algorithms for Learning Nash Equilibria in N-player General-Sum Games
We consider the problem of finding stationary Nash equilibria (NE) in a ...
research
∙
06/11/2013