This tutorial serves as an introduction to recently developed non-asympt...
We investigate the problems of model estimation and reward-free learning...
We investigate the problem of best policy identification in discounted l...
Controlling antenna tilts in cellular networks is imperative to reach an...
We consider the problem of online learning in Linear Quadratic Control
s...
We study the problem of best-arm identification with fixed confidence in...
We provide a new finite-time analysis of the estimation error of stable
...
This paper establishes problem-specific sample complexity lower bounds f...