An explicit linear solution for the quadratic dynamic programming problem期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

An explicit linear solution for the quadratic dynamic programming problem

Authors:	W. R. S. Sutherland H. Wolkowicz V. Zeidan

Affiliation:	(1) Department of Mathematics, Statistics and Computing Sciences, Dalhousie University, Halifax, Nova Scotia, Canada;(2) Department of Combinatorics and Optimization, University of Waterloo, Waterloo, Ontario, Canada;(3) Department of Applied Mathematics, University of Waterloo, Waterloo, Ontario, Canada

Abstract:	For a given vectorx₀, the sequence {x_t} which optimizes the sum of discounted rewardsr(x_t, x_t+1), wherer is a quadratic function, is shown to be generated by a linear decision rulex_t+1=Sx_t+R. Moreover, the coefficientsR,S are given by explicit formulas in terms of the coefficients of the reward functionr. A unique steady-state is shown to exist (except for a degenerate case), and its stability is discussed.

Keywords:	Dynamic programming discrete-time control theory linear decision rules
本文献已被 SpringerLink 等数据库收录！