While policy optimization algorithms have played an important role in re...
A fundamental question in reinforcement learning theory is: suppose the
...
We consider approximate dynamic programming in γ-discounted Markov
decis...
We consider the minimax query complexity of online planning with a gener...
We consider the problem of local planning in fixed-horizon Markov Decisi...
We consider the problem of local planning in fixed-horizon Markov Decisi...
We study algorithms for average-cost reinforcement learning problems wit...
We consider the problem of configuring general-purpose solvers to run
ef...
In spoken dialogue systems, we aim to deploy artificial intelligence to ...