首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Accelerated modified policy iteration algorithms for Markov decision processes
Authors:Oleksandr Shlakhter  Chi-Guhn Lee
Institution:1. Joseph L. Rotman School of Management, University of Toronto, 105 St. George Street, Toronto, ON, M5S 3E6, Canada
2. Department of Mechanical and Industrial Engineering, University of Toronto, 5 King’s College Road, Toronto, ON, M5S 3G8, Canada
Abstract:We propose a new approach to accelerate the convergence of the modified policy iteration method for Markov decision processes with the total expected discounted reward. In the new policy iteration an additional operator is applied to the iterate generated by Markov operator, resulting in a bigger improvement in each iteration.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号