Novel Reinforcement Learning Algorithms and Applications to Hybrid Control Design Problems

Gandhi, Meet

dc.contributor.advisor	Bhatnagar, Shalabh
dc.contributor.author	Gandhi, Meet
dc.date.accessioned	2021-07-04T09:50:30Z
dc.date.available	2021-07-04T09:50:30Z
dc.date.submitted	2021
dc.identifier.uri	https://etd.iisc.ac.in/handle/2005/5183
dc.description.abstract	The thesis is a compilation of two independent works. In the first work, we develop novel weight assignment procedure, which helps us develop several schedule based algorithms. Learning the value function of a given policy from the data samples is an important problem in Reinforcement Learning. TD(λ) is a popular class of algorithms to solve this problem. However, the weight assigned to different n-step returns decreases exponentially with increasing n in TD(λ). Here, we present a λ-schedule procedure that allows flexibility in weight assignment to the different n-step returns. Based on this procedure, we propose an on-policy algorithm, TD(λ)-schedule, and an off-policy algorithm, TDC(λ)-schedule, respectively. We provide proofs of almost sure convergence for both algo- rithms under a general Markov noise framework as well as present the results of experiments where these algorithms are seen to show improved performance. In the second work, we design hybrid control policies for hybrid systems whose mathemati- cal models are unknown. Our contributions are threefold here. First, we propose a framework for modelling the hybrid control design problem as a single Markov Decision Process (MDP). This result facilitates the application of off-the-shelf algorithms from Reinforcement Learning (RL) literature towards designing optimal control policies. Second, we model a set of bench- mark examples of hybrid control design problem in the proposed MDP framework. Third, we adapt the recently proposed Proximal Policy Optimisation (PPO) algorithm for the hybrid action space and apply it to the above set of problems. It is observed that in each case the algorithm converges and finds the optimal policy.	en_US
dc.language.iso	en_US	en_US
dc.rights	I grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertation	en_US
dc.subject	Reinforcement Learning	en_US
dc.subject	Stochastic Approximation	en_US
dc.subject	Hybrid Control Design	en_US
dc.subject.classification	Research Subject Categories::TECHNOLOGY::Information technology::Computer science	en_US
dc.title	Novel Reinforcement Learning Algorithms and Applications to Hybrid Control Design Problems	en_US
dc.type	Thesis	en_US
dc.degree.name	MTech (Res)	en_US
dc.degree.level	Masters	en_US
dc.degree.grantor	Indian Institute of Science	en_US
dc.degree.discipline	Engineering	en_US

Files in this item

Name:: MTech_Research_Thesis_Meet_Gan ...
Size:: 1.333Mb
Format:: PDF
Description:: Thesis full text

View/Open

This item appears in the following Collection(s)

Computer Science and Automation (CSA) [376]

Show simple item record