Model-based Safe Deep Reinforcement Learning and Empirical Analysis of Safety via Attribution

Jayant, Ashish

dc.contributor.advisor	Bhatnagar, Shalabh
dc.contributor.author	Jayant, Ashish
dc.date.accessioned	2022-09-13T04:40:10Z
dc.date.available	2022-09-13T04:40:10Z
dc.date.submitted	2022
dc.identifier.uri	https://etd.iisc.ac.in/handle/2005/5849
dc.description.abstract	During initial iterations of training in most Reinforcement Learning (RL) algorithms, agents perform a significant number of random exploratory steps, which in the real-world limit the practicality of these algorithms as this can lead to potentially dangerous behavior. Hence safe exploration is a critical issue in applying RL algorithms in the real world. This problem is well studied in the literature under the Constrained Markov Decision Process (CMDP) Framework, where in addition to single-stage rewards, state transitions receive single-stage costs as well. The prescribed cost functions are responsible for mapping undesirable behavior at any given time-step to a scalar value. Then we aim to find a feasible policy that maximizes reward returns and keeps cost returns below a prescribed threshold during training as well as deployment. We propose a novel On-policy Model-based Safe Deep RL algorithm in which we learn the transition dynamics of the environment in an online manner as well as find a feasible optimal policy using Lagrangian Relaxation-based Proximal Policy Optimization. This combination of transition dynamics learning and a safety-promoting RL algorithm leads to 3-4 times less environment interactions and less cumulative hazard violations compared to the model-free ap- proach. We use an ensemble of neural networks with different initializations to tackle epistemic and aleatoric uncertainty issues faced during environment model learning. We present our results on a challenging Safe Reinforcement Learning benchmark - the Open AI Safety Gym. In addition to this, we perform an attribution analysis of actions taken by the Deep Neural Network-based policy at each time step. This analysis helps us to : 1. Identify the feature in state representation which is significantly responsible for the current action. 2. Empirically provide the evidence of the safety-aware agent’s ability to deal with hazards in the environment provided that hazard information is present in the state representation. In order to perform the above analysis, we assume state representation has meaningful information about hazards and goals. Then we calculate an attribution vector of the same dimension as state using a well-known attribution technique known as Integrated Gradients. The resultant attribution vector provides the importance of each state feature for the current action.	en_US
dc.description.sponsorship	NA	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	masters;0019.R1
dc.rights	I grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertation	en_US
dc.subject	Safe RL	en_US
dc.subject	Planning	en_US
dc.subject	Reinforcement Learning	en_US
dc.subject	Safety	en_US
dc.subject	Explainability	en_US
dc.subject.classification	Safe Reinforcement Learning	en_US
dc.subject.classification	Model-based RL	en_US
dc.subject.classification	Constrained Reinforcement Learning	en_US
dc.subject.classification	Explainability	en_US
dc.title	Model-based Safe Deep Reinforcement Learning and Empirical Analysis of Safety via Attribution	en_US
dc.type	Thesis	en_US
dc.degree.name	MTech (Res)	en_US
dc.degree.level	Masters	en_US
dc.degree.grantor	Indian Institute of Science	en_US
dc.degree.discipline	Engineering	en_US

Files in this item

Name:: Updated_thesis.pdf
Size:: 2.781Mb
Format:: PDF
Description:: Thesis

View/Open

This item appears in the following Collection(s)

Computer Science and Automation (CSA) [394]

Show simple item record