We study variance-dependent regret bounds for Markov decision processes
...
We study regret minimization for reinforcement learning (RL) in Latent M...
Over the recent years, reinforcement learning (RL) has shown impressive
...
We study the problem of learning in the stochastic shortest path (SSP)
s...