Show simple item record

dc.contributor.advisorBhatnagar, Shalabh
dc.contributor.authorGuin, Soumyajit
dc.date.accessioned2025-04-16T05:24:17Z
dc.date.available2025-04-16T05:24:17Z
dc.date.submitted2025
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/6892
dc.description.abstractIn this thesis we will look at various Reinforcement Learning algorithms. We will look at algorithms for various cost criteria or reward objectives namely Finite Horizon, Discounted Cost, Risk-Sensitive Cost. For Finite Horizon and Risk-Sensitive Cost we derive the policy gradient, and for Discounted Cost we propose a new algorithm called Critic-Actor. We analyze and prove the convergence for all the proposed algorithms. We also analyze the empirical performance of our algorithms through numerical experiments.en_US
dc.language.isoen_USen_US
dc.relation.ispartofseries;ET00902
dc.rightsI grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertationen_US
dc.subjectReinforcement Learningen_US
dc.subjectalgorithmsen_US
dc.subjectFinite Horizonen_US
dc.subjectRisk-Sensitive Costen_US
dc.subjectDiscounted Costen_US
dc.subjectCritic-Actor Algorithmen_US
dc.subjectConvergenceen_US
dc.subject.classificationResearch Subject Categories::TECHNOLOGY::Information technology::Computer scienceen_US
dc.titleAlgorithms for various cost criteria in Reinforcement Learningen_US
dc.typeThesisen_US
dc.degree.namePhDen_US
dc.degree.levelDoctoralen_US
dc.degree.grantorIndian Institute of Scienceen_US
dc.degree.disciplineEngineeringen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record