We consider model selection for sequential decision making in stochastic...
Best-of-both-worlds algorithms for online learning which achieve near-op...
Policy optimization methods are popular reinforcement learning algorithm...
Blackwell's celebrated approachability theory provides a general framewo...
We study reinforcement learning in stochastic path (SP) problems. The go...
We study the problem of model selection in bandit scenarios in the prese...
Myopic exploration policies such as epsilon-greedy, softmax, or Gaussian...
To study information processing in the brain, neuroscientists manipulate...
We develop a model selection approach to tackle reinforcement learning w...
We provide improved gap-dependent regret bounds for reinforcement learni...
There have been many recent advances on provably efficient Reinforcement...
We investigate the problem of active learning in the streaming setting i...
We propose a simple model selection approach for algorithms in stochasti...
We study episodic reinforcement learning in Markov decision processes wh...
While maximizing expected return is the goal in most reinforcement learn...
The performance of a reinforcement learning algorithm can vary drastical...
We study the computational tractability of provably sample-efficient (PA...
In the artificial intelligence field, learning often corresponds to chan...
Statistical performance bounds for reinforcement learning (RL) algorithm...
Optimal stopping problems consider the question of deciding when to stop...
We propose a new method to study the internal memory used by reinforceme...
We introduce a framework and early results for massively scalable Gaussi...
Recently, there has been significant progress in understanding reinforce...
Bayesian nonparametric models, such as Gaussian processes, provide a
com...
We propose a computational model for shape, illumination and albedo infe...