首页 | 本学科首页   官方微博 | 高级检索  
     检索      


The Linear Program approach in multi-chain Markov Decision Processes revisited
Authors:Eitan Altman  Flos Spieksma
Institution:(1) Centre Sophia Antipolis, INRIA, 06565 Valbonne Cedex, France;(2) Institute of Mathematics & Computer Science, University of Leiden, P.O. Box 9512, 2300 RA Leiden, The Netherlands
Abstract:Linear Programming is known to be an important and useful tool for solving Markov Decision Processes (MDP). Its derivation relies on the Dynamic Programming approach, which also serves to solve MDP. However, for Markov Decision Processes with several constraints the only available methods are based on Linear Programs. The aim of this paper is to investigate some aspects of such Linear Programs, related to multi-chain MDPs. We first present a stochastic interpretation of the decision variables that appear in the Linear Programs available in the literature. We then show for the multi-constrained Markov Decision Process that the Linear Program suggested in 9] can be obtained from an equivalent unconstrained Lagrange formulation of the control problem. This shows the connection between the Linear Program approach and the Lagrange approach, that was previously used only for the case of a single constraint 3, 14, 15].
Keywords:Multi-chain Markov Decision Processes  average cost criterion  state-action frequencies  deviation measure  linear programming  lagrange formulation
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号