Differential dynamic programming and Newton's method for discrete optimal control problems |
| |
Authors: | D. M. Murray S. J. Yakowitz |
| |
Affiliation: | (1) Institute for Maritime Technology, Simonstown, South Africa;(2) Systems and Industrial Engineering Department, University of Arizona, Tucson, Arizona |
| |
Abstract: | ![]() The purpose of this paper is to draw a detailed comparison between Newton's method, as applied to discrete-time, unconstrained optimal control problems, and the second-order method known as differential dynamic programming (DDP). The main outcomes of the comparison are: (i) DDP does not coincide with Newton's method, but (ii) the methods are close enough that they have the same convergence rate, namely, quadratic.The comparison also reveals some other facts of theoretical and computational interest. For example, the methods differ only in that Newton's method operates on a linear approximation of the state at a certain point at which DDP operates on the exact value. This would suggest that DDP ought to be more accurate, an anticipation borne out in our computational example. Also, the positive definiteness of the Hessian of the objective function is easy to check within the framework of DDP. This enables one to propose a modification of DDP, so that a descent direction is produced at each iteration, regardless of the Hessian.Efforts of the first author were partially supported by the South African Council for Scientific and Industrial Research, and those of the second author by NSF Grants Nos. CME-79-05010 and CEE-81-10778. |
| |
Keywords: | Nonlinear programming optimal control optimal control algorithms nonlinear dynamics quadratic convergence |
本文献已被 SpringerLink 等数据库收录! |
|