Federated learning is an important framework in modern machine learning ...
This paper revisits the bandit problem in the Bayesian setting. The Baye...
We study the statistical properties of the dynamic trajectory of stochas...
In model-based reinforcement learning, the transition matrix and reward
...
We introduce a class of variational actor-critic algorithms based on a
v...
A data set sampled from a certain population is biased if the subgroups ...
In model-free reinforcement learning, the temporal difference method and...
We give a sharp convergence rate for the asynchronous stochastic gradien...
For model-free reinforcement learning, the main difficulty of stochastic...
We consider the Vlasov-Fokker-Planck equation with random electric field...
We study a chemical kinetic system with uncertainty modeling a gene
regu...
Stochastic gradient descent (SGD) is almost ubiquitously used for traini...