research
∙
01/26/2023
Collaborative Regret Minimization in Multi-Armed Bandits
In this paper, we study the collaborative learning model, which concerns...
research
∙
08/18/2022
Communication-Efficient Collaborative Best Arm Identification
We investigate top-m arm identification, a basic problem in bandit theor...
research
∙
07/16/2022
Collaborative Best Arm Identification with Limited Communication on Non-IID Data
In this paper, we study the tradeoffs between time-speedup and the numbe...
research
∙
08/15/2021
Batched Thompson Sampling for Multi-Armed Bandits
We study Thompson Sampling algorithms for stochastic multi-armed bandits...
research
∙
12/02/2020
Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit
Motivated by real-world applications such as fast fashion retailing and ...
research
∙
04/20/2020
Collaborative Top Distribution Identifications with Limited Interaction
We consider the following problem in this paper: given a set of n distri...
research
∙
03/13/2019