首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Asymptotic behavior of constrained stochastic approximations via the theory of large deviations
Authors:Paul Dupuis  Harold J Kushner
Institution:(1) Lefschetz Center for Dynamical Systems, Division of Applied Mathematics, Brown University, 02912 Providence, RI, USA
Abstract:Summary Let G be a bounded convex set, and Pgr G the projection onto G, and 
$$\{ \xi _j \} $$
a bounded random process. Projected algorithms of the types 
$$X_{n + 1}^\varepsilon   = \Pi _G (X_n^\varepsilon   + \varepsilon b(X_n^\varepsilon  ,\xi _n ))({\text{or }}X_{n + 1}  = \Pi _G (X_n  + a_n b(X_n ,\xi _n ))$$
, where 0<a nrarr0, Sgr a n =infin) occur frequently in applications (among other places) in control and communications theory. The asymptotic convergence properties of {X n epsi } as epsirarr0, epsinrarrinfin, have been well analyzed in the literature. Here, we use large deviations methods to get a more thorough understanding of the global behavior. Let THgr be a stable point of the algorithm in the sense that X n epsi rarrTHgr in distribution as epsirarr0, nepsirarrinfin. For the unconstrained case, rate of convergence results involve showing asymptotic normality of 
$$\{ (X_n^\varepsilon   - {{\Theta )} \mathord{\left/ {\vphantom {{\Theta )} {\sqrt \varepsilon  }}} \right. \kern-\nulldelimiterspace} {\sqrt \varepsilon  }}\} $$
, and use linearizations about THgr. In the constrained case THgr is often on partG, and such methods are inapplicable. But the large deviations method yields an alternative which is often more useful in the applications. The action functionals are derived and their properties (lower semicontinuity, etc.) are obtained. The statistics (mean value, etc.) of the escape times from a neighborhood of THgr are obtained, and the global behavior on the infinite interval is described.Research has been supported in part by the US Army Research Office under Contract #DAAG 29-84-K-0082, and in part by the Office of Naval Research under Contract #N00014-83-K0542Research has been supported in part by the National Science Foundation Grant #ECS 82-11476, and the Air Force Office of Scientific Research under Contract #AF-AFOSR 81-0116
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号