首页 | 本学科首页   官方微博 | 高级检索  
     检索      


On the adaptive control of a class of partially observed Markov decision processes
Authors:Shun-Pin Hsu  Ari Arapostathis
Institution:a Department of Electrical Engineering, National Chung Hsing University, 250, Kuo-Kuang Rd., Taichung 402, Taiwan
b Department of Electrical and Computer Engineering, The University of Texas at Austin, 1 University Station C0803, Austin, TX 78712-0240, USA
Abstract:This paper is concerned with the adaptive control problem, over the infinite horizon, for partially observable Markov decision processes whose transition functions are parameterized by an unknown vector. We treat finite models and impose relatively mild assumptions on the transition function. Provided that a sequence of parameter estimates converging in probability to the true parameter value is available, we show that the certainty equivalence adaptive policy is optimal in the long-run average sense.
Keywords:Adaptive control  Markov decision processes  Average-cost optimality
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号