首页 | 本学科首页   官方微博 | 高级检索  
     检索      


A structured pattern matrix algorithm for multichain Markov decision processes
Authors:Tetsuichiro Iki  Masayuki Horiguchi  Masami Kurano
Institution:(1) Faculty of Education and Culture, Miyazaki University, Miyazaki 889-2192, Japan;(2) General Education, Yuge National College of Maritime Technology, Ehime 794-2593, Japan;(3) Faculty of Education, Chiba University, Chiba 263-8522, Japan
Abstract:In this paper, we are concerned with a new algorithm for multichain finite state Markov decision processes which finds an average optimal policy through the decomposition of the state space into some communicating classes and a transient class. For each communicating class, a relatively optimal policy is found, which is used to find an optimal policy by applying the value iteration algorithm. Using a pattern matrix determining the behaviour pattern of the decision process, the decomposition of the state space is effectively done, so that the proposed algorithm simplifies the structured one given by the excellent Leizarowitz’s paper (Math Oper Res 28:553–586, 2003). Also, a numerical example is given to comprehend the algorithm.
Keywords:Multichain Markov decision processes  Structured algorithm  Communicating class  Transient class  Value iteration
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号