A <em>K</em>-step look-ahead analysis of value iteration algorithms for Markov decision processes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A K-step look-ahead analysis of value iteration algorithms for Markov decision processes

Authors:	Meir Herzberg Uri Yechiali

Affiliation:	1. Telecom Australia Research Laboratories, 770 Blackburn Rd., Clayton, Vic. 3168, Australia;2. Department of Statistics and Operations Research, School of Mathematical Sciences, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel

Abstract:	We introduce and analyze a general look-ahead approach for Value Iteration Algorithms used in solving both discounted and undiscounted Markov decision processes. This approach, based on the value-oriented concept interwoven with multiple adaptive relaxation factors, leads to accelerating procedures which perform better than the separate use of either the concept of value oriented or of relaxation. Evaluation and computational considerations of this method are discussed, practical guidelines for implementation are suggested and the suitability of enhancing the method by incorporating Phase 0, Action Elimination procedures and Parallel Processing is indicated. The method was successfully applied to several real problems. We present some numerical results which support the superiority of the developed approach, particularly for undiscounted cases, over other Value Iteration variants.

Keywords:	Markov processes Value iteration Modified policy iteration Adaptive relaxation factor Look-ahead analysis
本文献已被 ScienceDirect 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏