A Concentration Bound for LSPE(λ)

11/04/2021
by   Vivek S. Borkar, et al.
4

The popular LSPE(λ) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset