We consider the reinforcement learning (RL) setting, in which the agent ...
We study the Combinatorial Thompson Sampling policy (CTS) for combinator...
We demonstrate that from an algorithm guaranteeing an approximation fact...
We investigate stochastic combinatorial multi-armed bandit with semi-ban...
We consider the problem of active linear regression where a decision mak...
We improve the efficiency of algorithms for stochastic combinatorial
sem...
We consider the problem where an agent wants to find a hidden object tha...