We introduce DeepNash, an autonomous agent capable of learning to play t...
For artificially intelligent learning systems to have widespread
applica...
We introduce a human-compatible reinforcement-learning approach to a
coo...
A common metric in games of imperfect information is exploitability, i.e...
Constructing agents with planning capabilities has long been one of the ...
OpenSpiel is a collection of environments and algorithms for research in...
In this paper, we present exploitability descent, a new algorithm to com...
Stochastic video prediction is usually framed as an extrapolation proble...
We introduce an approach for deep reinforcement learning (RL) that impro...
Sequential models achieve state-of-the-art results in audio, visual and
...
The recently-developed WaveNet architecture is the current state of the ...