Single and Multi-player Stochastic Dynamic Optimization

Saha, Subhamay

dc.contributor.advisor	Ghosh, Mrinal K
dc.contributor.author	Saha, Subhamay
dc.date.accessioned	2018-04-06T06:52:51Z
dc.date.accessioned	2018-07-31T06:09:19Z
dc.date.available	2018-04-06T06:52:51Z
dc.date.available	2018-07-31T06:09:19Z
dc.date.issued	2018-04-06
dc.date.submitted	2013
dc.identifier.uri	https://etd.iisc.ac.in/handle/2005/3357
dc.identifier.abstract	https://etd.iisc.ac.in/static/etd/abstracts/4224/G25755-Abs.pdf	en_US
dc.description.abstract	In this thesis we investigate single and multi-player stochastic dynamic optimization prob-lems. We consider both discrete and continuous time processes. In the multi-player setup we investigate zero-sum games with both complete and partial information. We study partially observable stochastic games with average cost criterion and the state process be-ing discrete time controlled Markov chain. The idea involved in studying this problem is to replace the original unobservable state variable with a suitable completely observable state variable. We establish the existence of the value of the game and also obtain optimal strategies for both players. We also study a continuous time zero-sum stochastic game with complete observation. In this case the state is a pure jump Markov process. We investigate the nite horizon total cost criterion. We characterise the value function via appropriate Isaacs equations. This also yields optimal Markov strategies for both players. In the single player setup we investigate risk-sensitive control of continuous time Markov chains. We consider both nite and in nite horizon problems. For the nite horizon total cost problem and the in nite horizon discounted cost problem we characterise the value function as the unique solution of appropriate Hamilton Jacobi Bellman equations. We also derive optimal Markov controls in both the cases. For the in nite horizon average cost case we shown the existence of an optimal stationary control. we also give a value iteration scheme for computing the optimal control in the case of nite state and action spaces. Further we introduce a new class of stochastic processes which we call stochastic processes with \age-dependent transition rates". We give a rigorous construction of the process. We prove that under certain assunptions the process is Feller. We also compute the limiting probabilities for our process. We then study the controlled version of the above process. In this case we take the risk-neutral cost criterion. We solve the in nite horizon discounted cost problem and the average cost problem for this process. The crucial step in analysing these problems is to prove that the original control problem is equivalent to an appropriate semi-Markov decision problem. Then the value functions and optimal controls are characterised using this equivalence and the theory of semi-Markov decision processes (SMDP). The analysis of nite horizon problems becomes di erent from that of in nite horizon problems because of the fact that in this case the idea of converting into an equivalent SMDP does not seem to work. So we deal with the nite horizon total cost problem by showing that our problem is equivalent to another appropriately de ned discrete time Markov decision problem. This allows us to characterise the value function and to nd an optimal Markov control.	en_US
dc.language.iso	en_US	en_US
dc.relation.ispartofseries	G25755	en_US
dc.subject	Stochastic Dynamic Optimization	en_US
dc.subject	Stochastic Control Theory	en_US
dc.subject	Stochastic Processes	en_US
dc.subject	Markov Processes	en_US
dc.subject	Continuous-Time Markov Chains	en_US
dc.subject	Stochastic Games	en_US
dc.subject	Semi-Markov Decision Processes	en_US
dc.subject	Markov Processes - Optimal Control	en_US
dc.subject	Continuous Time Stochastic Processes	en_US
dc.subject	Dicrete Time Stochastic Processes	en_US
dc.subject	Continuous Time Markov Chains	en_US
dc.subject	Semi-Markov Decision Processes (SMDP)	en_US
dc.subject	Optimal Markov Control	en_US
dc.subject.classification	Mathematics	en_US
dc.title	Single and Multi-player Stochastic Dynamic Optimization	en_US
dc.type	Thesis	en_US
dc.degree.name	PhD	en_US
dc.degree.level	Doctoral	en_US
dc.degree.discipline	Faculty of Science	en_US