Modern recommender systems lie at the heart of complex ecosystems that c...
Users derive value from a recommender system (RS) only to the extent tha...
While popularity bias is recognized to play a role in recommmender (and ...
We introduce Dynamic Contextual Markov Decision Processes (DCMDPs), a no...
The development of recommender systems that optimize multi-turn interact...
Efficient exploration in multi-armed bandits is a fundamental online lea...
Most recommender systems (RS) research assumes that a user's utility can...
We study a contextual bandit setting where the learning agent has access...
We learn bandit policies that maximize the average reward over bandit
in...
We propose RecSim, a configurable platform for authoring simulation
envi...
Latent-state environments with long horizons, such as those faced by
rec...
The prevalent approach to bandit algorithm design is to have a low-regre...
In many practical uses of reinforcement learning (RL) the set of actions...
Symmetry is the essential element of lifted inference that has recently
...
A recent trend in probabilistic inference emphasizes the codification of...
We propose relational linear programming, a simple framework for combing...