首页 | 本学科首页   官方微博 | 高级检索  
     


Generalized polynomial approximations in Markovian decision processes
Authors:Paul J. Schweitzer  Abraham Seidmann
Affiliation:The Graduate School of Management, The University of Rochester, Rochester, New York 14627 USA;Department of Industrial Engineering, Tel Aviv University, Ramat Aviv, Israel
Abstract:Fitting the value function in a Markovian decision process by a linear superposition of M basis functions reduces the problem dimensionality from the number of states down to M, with good accuracy retained if the value function is a smooth function of its argument, the state vector. This paper provides, for both the discounted and undiscounted cases, three algorithms for computing the coefficients in the linear superposition: linear programming, policy iteration, and least squares.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号