Generating samples given a specific label requires estimating conditiona...
Long-term care service for old people is in great demand in most of the ...
We propose a novel contextual bandit algorithm for generalized linear re...
We propose a novel algorithm for linear contextual bandits with O(√(dT
l...
Non-stationarity is ubiquitous in human behavior and addressing it in th...
A challenging aspect of the bandit problem is that a stochastic reward i...
The Mixup method (Zhang et al. 2018), which uses linearly interpolated d...
Wasserstein distributionally robust optimization (WDRO) attempts to lear...
Contextual multi-armed bandit algorithms are widely used in sequential
d...
Contextual multi-armed bandit (MAB) algorithms have been shown promising...
We consider the problem of learning a binary classifier from only positi...