Algorithms for various cost criteria in Reinforcement Learning

Guin, Soumyajit

dc.contributor.advisor	Bhatnagar, Shalabh
dc.contributor.author	Guin, Soumyajit
dc.date.accessioned	2025-04-16T05:24:17Z
dc.date.available	2025-04-16T05:24:17Z
dc.date.submitted	2025
dc.identifier.uri	https://etd.iisc.ac.in/handle/2005/6892
dc.description.abstract	In this thesis we will look at various Reinforcement Learning algorithms. We will look at algorithms for various cost criteria or reward objectives namely Finite Horizon, Discounted Cost, Risk-Sensitive Cost. For Finite Horizon and Risk-Sensitive Cost we derive the policy gradient, and for Discounted Cost we propose a new algorithm called Critic-Actor. We analyze and prove the convergence for all the proposed algorithms. We also analyze the empirical performance of our algorithms through numerical experiments.	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	;ET00902
dc.rights	I grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertation	en_US
dc.subject	Reinforcement Learning	en_US
dc.subject	algorithms	en_US
dc.subject	Finite Horizon	en_US
dc.subject	Risk-Sensitive Cost	en_US
dc.subject	Discounted Cost	en_US
dc.subject	Critic-Actor Algorithm	en_US
dc.subject	Convergence	en_US
dc.subject.classification	Research Subject Categories::TECHNOLOGY::Information technology::Computer science	en_US
dc.title	Algorithms for various cost criteria in Reinforcement Learning	en_US
dc.type	Thesis	en_US
dc.degree.name	PhD	en_US
dc.degree.level	Doctoral	en_US
dc.degree.grantor	Indian Institute of Science	en_US
dc.degree.discipline	Engineering	en_US

Files in this item

Name:: Soumyajit Guin-Phd-2025.pdf
Size:: 4.012Mb
Format:: PDF
Description:: Thesis full text

View/Open

This item appears in the following Collection(s)

Computer Science and Automation (CSA) [545]

Show simple item record