首页 | 本学科首页   官方微博 | 高级检索  
     检索      


(Approximate) iterated successive approximations algorithm for sequential decision processes
Authors:Pelin G Canbolat  Uriel G Rothblum
Institution:1. Faculty of Industrial Engineering and Management, The Technion—Israel Institute of Technology, Haifa, 32000, Israel
Abstract:The paper proves the convergence of (Approximate) Iterated Successive Approximations Algorithm for solving infinite-horizon sequential decision processes satisfying the monotone contraction assumption. At every stage of this algorithm, the value function at hand is used as a terminal reward to determine an (approximately) optimal policy for the one-period problem. This policy is then iterated for a (finite or infinite) number of times and the resulting return function is used as the starting value function for the next stage of the scheme. This method generalizes the standard successive approximations, policy iteration and Denardo’s generalization of the latter.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号