Adaptive policies for time-varying stochastic systems under discounted criterion期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Adaptive policies for time-varying stochastic systems under discounted criterion

Authors:	Nadine Hilgert J Adolfo Minjárez-Sosa

Institution:	(1) Laboratoire de Biométrie, INRA-ENSA.M, 2 place Viala, 34060 Montpellier CEDEX 1, France. (hilgert@ensam.inra.fr). The research of this author was performed while she was visiting the Departamento de Matemáticas, CINVESTAV-IPN, México, DF., MX;(2) Departamento de Matemáticas, Universidad de Sonora, Rosales s/n, Col. Centro, 83000, Hermosillo, Sonora, México. (aminjare@gauss.mat.uson.mx), MX

Abstract:	We consider a class of time-varying stochastic control systems, with Borel state and action spaces, and possibly unbounded costs. The processes evolve according to a discrete-time equation x _{n + 1}=G _n (x _n , a _n , ξ_n), n=0, 1, … , where the ξ_n are i.i.d. ℜ^k-valued random vectors whose common density is unknown, and the G _n are given functions converging, in a restricted way, to some function G _∞ as n→∞. Assuming observability of ξ_n, we construct an adaptive policy which is asymptotically discounted cost optimal for the limiting control system x _n+1=G _∞ (x _n , a _n , ξ_n).

Keywords:	AMS 1991 subject classifications: 93E20 90C40
本文献已被 SpringerLink 等数据库收录！