A computational successive improvement scheme for adaptive optimal control processes |
| |
Authors: | S. E. Dreyfus H. L. Stalford |
| |
Affiliation: | (1) Department of Industrial Engineering and Operations Research, University of California, Berkeley, California;(2) Radar Division, Naval Research Laboratory, Washington, D.C. |
| |
Abstract: | ![]() To computationally solve an adaptive optimal control problem by means of conventional dynamic programming, a backward recursion must be used with an optimal value and optimal control determined for all conceivable prior information patterns and prior control histories. Consequently, almost all problems are beyond the capability of even large computers.As an alternative, we develop in this paper a computational successive improvement scheme which involves choosing a nominal control policy and then improving it at each iteration. Each improvement involves considerable computation, but much less than the straightforward dynamic programming algorithm. As in any local-improvement procedure, the scheme may converge to something which is inferior to the absolutely optimal control.This paper has been supported by the National Science Foundation under Grant No. GP-25081. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|