(Approximate) iterated successive approximations algorithm for sequential decision processes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

(Approximate) iterated successive approximations algorithm for sequential decision processes

Authors:	Pelin G Canbolat Uriel G Rothblum

Institution:	1. Faculty of Industrial Engineering and Management, The Technion—Israel Institute of Technology, Haifa, 32000, Israel

Abstract:	The paper proves the convergence of (Approximate) Iterated Successive Approximations Algorithm for solving infinite-horizon sequential decision processes satisfying the monotone contraction assumption. At every stage of this algorithm, the value function at hand is used as a terminal reward to determine an (approximately) optimal policy for the one-period problem. This policy is then iterated for a (finite or infinite) number of times and the resulting return function is used as the starting value function for the next stage of the scheme. This method generalizes the standard successive approximations, policy iteration and Denardo’s generalization of the latter.

Keywords:
本文献已被 SpringerLink 等数据库收录！