research
∙
08/25/2021
A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits
This paper unifies the design and simplifies the analysis of risk-averse...
research
∙
05/14/2021
Thompson Sampling for Gaussian Entropic Risk Bandits
The multi-armed bandit (MAB) problem is a ubiquitous decision-making pro...
research
∙
11/16/2020