A structured pattern matrix algorithm for multichain Markov decision processes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

A structured pattern matrix algorithm for multichain Markov decision processes

Authors:	Tetsuichiro Iki Masayuki Horiguchi Masami Kurano

Institution:	(1) Faculty of Education and Culture, Miyazaki University, Miyazaki 889-2192, Japan;(2) General Education, Yuge National College of Maritime Technology, Ehime 794-2593, Japan;(3) Faculty of Education, Chiba University, Chiba 263-8522, Japan

Abstract:	In this paper, we are concerned with a new algorithm for multichain finite state Markov decision processes which finds an average optimal policy through the decomposition of the state space into some communicating classes and a transient class. For each communicating class, a relatively optimal policy is found, which is used to find an optimal policy by applying the value iteration algorithm. Using a pattern matrix determining the behaviour pattern of the decision process, the decomposition of the state space is effectively done, so that the proposed algorithm simplifies the structured one given by the excellent Leizarowitz’s paper (Math Oper Res 28:553–586, 2003). Also, a numerical example is given to comprehend the algorithm.

Keywords:	Multichain Markov decision processes Structured algorithm Communicating class Transient class Value iteration
本文献已被 SpringerLink 等数据库收录！