We consider the Reinforcement Learning problem of controlling an unknown...
Logistic Bandits have recently undergone careful scrutiny by virtue of t...
Generalized Linear Bandits (GLBs) are powerful extensions to the Linear
...
Logistic Bandits have recently attracted substantial attention, by provi...
In display advertising, a small group of sellers and bidders face each o...
We study the exploration-exploitation dilemma in the linear quadratic
re...
The generalized linear bandit framework has attracted a lot of attention...
Restless bandit problems assume time-varying reward distributions of the...
Second price auctions with reserve price are widely used by the main Int...
With the increasing use of auctions in online advertising, there has bee...
We consider the exploration-exploitation tradeoff in linear quadratic (L...
We derive an alternative proof for the regret of Thompson sampling () in...