DeepAI

AI Chat AI Image Generator AI Video AI Music Generator

Exploring Variational Deep Q Networks

08/04/2020

∙

by A. H. Bell-Thomas, et al.

∙

∙

This study provides both analysis and a refined, research-ready implementation of Tang and Kucukelbir's Variational Deep Q Network, a novel approach to maximising the efficiency of exploration in complex learning environments using Variational Bayesian Inference. Alongside reference implementations of both Traditional and Double Deep Q Networks, a small novel contribution is presented - the Double Variational Deep Q Network, which incorporates improvements to increase the stability and robustness of inference-based learning. Finally, an evaluation and discussion of the effectiveness of these approaches is discussed in the wider context of Bayesian Deep Learning.

page 8

page 9

research

∙ 02/13/2018

Efficient Exploration through Bayesian Deep Q-Networks

We propose Bayesian Deep Q-Network (BDQN), a practical Thompson sampling...

0 Kamyar Azizzadenesheli, et al. ∙

research

∙ 10/18/2017

Variational Inference based on Robust Divergences

Robustness to outliers is a central issue in real-world machine learning...

1 Futoshi Futami, et al. ∙

research

∙ 09/05/2021

An Exploration of Deep Learning Methods in Hungry Geese

Hungry Geese is a n-player variation of the popular game snake. This pap...

0 Nikzad Khani, et al. ∙

research

∙ 04/16/2021

Uncertainty Surrogates for Deep Learning

In this paper we introduce a novel way of estimating prediction uncertai...

0 Radhakrishna Achanta, et al. ∙

research

∙ 11/30/2017

Variational Deep Q Network

We propose a framework that directly tackles the probability distributio...

1 Yunhao Tang, et al. ∙

research

∙ 10/31/2012

Linear-Nonlinear-Poisson Neuron Networks Perform Bayesian Inference On Boltzmann Machines

One conjecture in both deep learning and classical connectionist viewpoi...

0 Louis Yuanlong Shao, et al. ∙

research

∙ 06/06/2019

Practical Deep Learning with Bayesian Principles

Bayesian methods promise to fix many shortcomings of deep learning, but ...

23 Kazuki Osawa, et al. ∙