Inferential Induction: Joint Bayesian Estimation of MDPs and Value Functions

02/08/2020
by   Christos Dimitrakakis, et al.
13

Bayesian reinforcement learning (BRL) offers a decision-theoretic solution to the problem of reinforcement learning. However, typical model-based BRL algorithms have focused either on ma intaining a posterior distribution on models or value functions and combining this with approx imate dynamic programming or tree search. This paper describes a novel backwards induction pri nciple for performing joint Bayesian estimation of models and value functions, from which many new BRL algorithms can be obtained. We demonstrate this idea with algorithms and experiments in discrete state spaces.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset