The Linear Program approach in multi-chain Markov Decision Processes revisited期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

The Linear Program approach in multi-chain Markov Decision Processes revisited

Authors:	Eitan Altman Flos Spieksma

Institution:	(1) Centre Sophia Antipolis, INRIA, 06565 Valbonne Cedex, France;(2) Institute of Mathematics & Computer Science, University of Leiden, P.O. Box 9512, 2300 RA Leiden, The Netherlands

Abstract:	Linear Programming is known to be an important and useful tool for solving Markov Decision Processes (MDP). Its derivation relies on the Dynamic Programming approach, which also serves to solve MDP. However, for Markov Decision Processes with several constraints the only available methods are based on Linear Programs. The aim of this paper is to investigate some aspects of such Linear Programs, related to multi-chain MDPs. We first present a stochastic interpretation of the decision variables that appear in the Linear Programs available in the literature. We then show for the multi-constrained Markov Decision Process that the Linear Program suggested in 9] can be obtained from an equivalent unconstrained Lagrange formulation of the control problem. This shows the connection between the Linear Program approach and the Lagrange approach, that was previously used only for the case of a single constraint 3, 14, 15].

Keywords:	Multi-chain Markov Decision Processes average cost criterion state-action frequencies deviation measure linear programming lagrange formulation
本文献已被 SpringerLink 等数据库收录！