Algorithms for various cost criteria in Reinforcement Learning

Guin, Soumyajit

View/Open

Thesis full text (4.012Mb)

Author

Guin, Soumyajit

Metadata

Show full item record

Abstract

In this thesis we will look at various Reinforcement Learning algorithms. We will look at algorithms for various cost criteria or reward objectives namely Finite Horizon, Discounted Cost, Risk-Sensitive Cost. For Finite Horizon and Risk-Sensitive Cost we derive the policy gradient, and for Discounted Cost we propose a new algorithm called Critic-Actor. We analyze and prove the convergence for all the proposed algorithms. We also analyze the empirical performance of our algorithms through numerical experiments.

URI

https://etd.iisc.ac.in/handle/2005/6892

Collections

Computer Science and Automation (CSA) [547]