On Reducing Undesirable Behavior in Deep Reinforcement Learning Models

09/06/2023
by   Ophir Carmel, et al.
0

Deep reinforcement learning (DRL) has proven extremely useful in a large variety of application domains. However, even successful DRL-based software can exhibit highly undesirable behavior. This is due to DRL training being based on maximizing a reward function, which typically captures general trends but cannot precisely capture, or rule out, certain behaviors of the system. In this paper, we propose a novel framework aimed at drastically reducing the undesirable behavior of DRL-based software, while maintaining its excellent performance. In addition, our framework can assist in providing engineers with a comprehensible characterization of such undesirable behavior. Under the hood, our approach is based on extracting decision tree classifiers from erroneous state-action pairs, and then integrating these trees into the DRL training loop, penalizing the system whenever it performs an error. We provide a proof-of-concept implementation of our approach, and use it to evaluate the technique on three significant case studies. We find that our approach can extend existing frameworks in a straightforward manner, and incurs only a slight overhead in training time. Further, it incurs only a very slight hit to performance, or even in some cases - improves it, while significantly reducing the frequency of undesirable behavior.

READ FULL TEXT
research
05/04/2022

Using Deep Reinforcement Learning to solve Optimal Power Flow problem with generator failures

Deep Reinforcement Learning (DRL) is being used in many domains. One of ...
research
11/05/2016

Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening

We propose a novel training algorithm for reinforcement learning which c...
research
06/16/2019

MoËT: Interpretable and Verifiable Reinforcement Learning via Mixture of Expert Trees

Deep Reinforcement Learning (DRL) has led to many recent breakthroughs o...
research
03/01/2019

TrojDRL: Trojan Attacks on Deep Reinforcement Learning Agents

Recent work has identified that classification models implemented as neu...
research
02/05/2023

Open Problems and Modern Solutions for Deep Reinforcement Learning

Deep Reinforcement Learning (DRL) has achieved great success in solving ...
research
07/07/2022

DRL-ISP: Multi-Objective Camera ISP with Deep Reinforcement Learning

In this paper, we propose a multi-objective camera ISP framework that ut...
research
07/05/2020

Unsupervised Paraphrasing via Deep Reinforcement Learning

Paraphrasing is expressing the meaning of an input sentence in different...

Please sign up or login with your details

Forgot password? Click here to reset