research
∙
04/26/2023
Thompson Sampling Regret Bounds for Contextual Bandits with sub-Gaussian rewards
In this work, we study the performance of the Thompson Sampling algorith...
research
∙
07/18/2022