首页 | 本学科首页   官方微博 | 高级检索  
     


Average cost Markov decision processes under the hypothesis of Doeblin
Authors:Masami Kurano
Affiliation:(1) Department of Mathematics, Faculty of Education, Chiba University, 260 Chiba, Japan
Abstract:Average cost Markov decision processes (MDPs) with compact state and action spaces and bounded lower semicontinuous cost functions are considered. Kurano [7] has treated the general case in which several ergodic classes and a transient set are permitted for the Markov process induced by any randomized stationary policy under the hypothesis of Doeblin and showed the existence of a minimum pair of state and policy. This paper considers the same case as that discussed in Kurano [7] and proves some new results which give the existence theorem of an optimal stationary policy under some reasonable conditions.
Keywords:Markov decision process  average cost criterion  Doeblin condition
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号