We study a game played between advertisers in an online ad platform. The...
We consider policy optimization in contextual bandits, where one is give...
We study social learning dynamics where the agents collectively follow a...
We study a game between autobidding algorithms that compete in an online...
We consider contextual bandits with knapsacks (CBwK), a variant of the
c...
Consider a bandit algorithm that recommends actions to self-interested u...
We study the aggregate welfare and individual regret guarantees of dynam...
Cloud computing customers often submit repeating jobs and computation
pi...
The difficulty of recruiting patients is a well-known issue in clinical
...
We observe that many system policies that make threshold decisions invol...
How do you incentivize self-interested agents to explore when they
prefe...
We study the problem of finding personalized reserve prices for unit-dem...
Lipschitz bandits is a prominent version of multi-armed bandits that stu...
We create a computationally tractable algorithm for contextual bandits w...
We propose an algorithm for tabular episodic reinforcement learning with...
Online learning algorithms, widely used to power search and content
opti...
We consider incentivized exploration: a version of multi-armed bandits w...
"Bandits with Knapsacks" () is a general model for multi-armed bandits
u...
We initiate the study of multi-stage episodic reinforcement learning und...
Multi-armed bandits a simple but very powerful framework for algorithms ...
It is common in recommendation systems that users both consume and produ...
We empirically study the interplay between exploration and competition.
...
We empirically study the interplay between exploration and competition.
...
We study contextual bandit learning with an abstract policy class and
co...
We consider Bandits with Knapsacks (henceforth, BwK), a general model fo...
In a social learning setting, there is a set of actions, each of which h...
Online learning algorithms, widely used to power search and content
opti...
Crowdsourcing has been part of the IR toolbox as a cheap and fast mechan...