The study of collaborative multi-agent bandits has attracted significant...
Cascading bandits model the task of learning to rank K out of L items
ov...
We consider a multi-agent multi-armed bandit setting in which n honest
a...
For the misspecified linear Markov decision process (MLMDP) model of Jin...
We propose two algorithms for episodic stochastic shortest path problems...
We consider a variant of the traditional multi-armed bandit problem in w...
There has been recent interest in collaborative multi-agent bandits, whe...
We devise and analyze algorithms for the empirical policy evaluation pro...
Given a lazy, reversible Markov chain with n states and transition matri...
In recent years, people have increasingly turned to social networks like...
Many systems, including the Internet, social networks, and the power gri...