Adaptive regularization methods that exploit more than the diagonal entr...
Learning over sparse, high-dimensional data frequently necessitates the ...
Increasing the mini-batch size for stochastic gradient descent offers
si...
Recent model-free reinforcement learning algorithms have proposed
incorp...
Gaussian processes (GPs), or distributions over arbitrary functions in a...