Defending computer networks from cyber attack requires timely responses ...
High-dimensional policies, such as those represented by neural networks,...
Defending computer networks from cyber attack requires coordinating acti...
Monte Carlo planners can often return sub-optimal actions, even if they ...
A collision avoidance system based on simple digital cameras would help
...
Online solvers for partially observable Markov decision processes have
d...
Online solvers for partially observable Markov decision processes have
d...
Stochastic processes generated by non-stationary distributions are diffi...
Poor sample efficiency is a major limitation of deep reinforcement learn...
Although deep reinforcement learning has advanced significantly over the...
A reliable sense-and-avoid system is critical to enabling safe autonomou...
Deep artificial neural networks (ANNs) can represent a wide range of com...