In this paper we consider a stochastic game for modelling the interactio...
We present a novel recommender systems dataset that records the sequenti...
We consider a variant of online binary classification where a learner
se...
We study multi-agent reinforcement learning (MARL) in infinite-horizon
d...
We consider the problem of recommending relevant content to users of an
...
This paper describes a general-purpose extension of max-value entropy se...
This article develops a Bayesian optimization (BO) method which acts dir...
Learning the optimal ordering of content is an important challenge in we...
Deployments of Bayesian Optimization (BO) for functions with stochastic
...
We propose MUMBO, the first high-performing yet computationally efficien...
Thompson Sampling is a well established approach to bandit and reinforce...
Functional linear regression is a widely used approach to model function...
We present FIESTA, a model selection approach that significantly reduces...
This paper considers the posterior contraction of non-parametric Bayesia...
We consider the problem of adaptively placing sensors along an interval ...
Maximising the detection of intrusions is a fundamental and often critic...
This paper examines the long-run behavior of learning with bandit feedba...
K-fold cross validation (CV) is a popular method for estimating the true...
Motivated by problems in search and detection we present a solution to a...
Motivated by the recent applications of game-theoretical learning techni...
It is now well known that decentralised optimisation can be formulated a...
The asymptotic pseudo-trajectory approach to stochastic approximation of...