Consider a decision-maker that can pick one out of K actions to control ...
Following the novel paradigm developed by Van Roy and coauthors for
rein...
The rapid development of Industrial Internet of Things (IIoT) technologi...
The popular LSPE(λ) algorithm for policy evaluation is revisited to
deri...
Using a martingale concentration inequality, concentration bounds `from ...
As the use of Internet of Things (IoT) devices for monitoring purposes
b...