Generalized polynomial approximations in Markovian decision processes |
| |
Authors: | Paul J. Schweitzer Abraham Seidmann |
| |
Affiliation: | The Graduate School of Management, The University of Rochester, Rochester, New York 14627 USA;Department of Industrial Engineering, Tel Aviv University, Ramat Aviv, Israel |
| |
Abstract: | Fitting the value function in a Markovian decision process by a linear superposition of M basis functions reduces the problem dimensionality from the number of states down to M, with good accuracy retained if the value function is a smooth function of its argument, the state vector. This paper provides, for both the discounted and undiscounted cases, three algorithms for computing the coefficients in the linear superposition: linear programming, policy iteration, and least squares. |
| |
Keywords: | |
本文献已被 ScienceDirect 等数据库收录! |
|