This article is a primer on concept extrapolation - the ability to take ...
As artificial intelligence becomes more powerful and a ubiquitous presen...
For an artificial intelligence (AI) to be aligned with human values (or ...
Sigmoids (AKA s-curves or logistic curves) are commonly used in a divers...
To reduce the danger of powerful super-intelligent AIs, we might make th...
In some agent designs like inverse reinforcement learning an agent needs...
Partially Observable Markov Decision Processes (POMDPs) are rich environ...
Indifference is a class of methods that are used to control a reward bas...
Inverse reinforcement learning (IRL) attempts to infer human rewards or
...
An Oracle is a design for potentially high power artificial intelligence...
There are many goals for an AI that could become dangerous if the AI bec...
This paper sets out to resolve how agents ought to act in the Sleeping B...