首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Blackwell optimal policies in a Markov decision process with a Borel state space
Authors:A A Yushkevich
Institution:(1) Department of Mathematics, University of North Carolina at Charlotte, 28223 Charlotte, NC, USA
Abstract:After an introduction into sensitive criteria in Markov decision processes and a discussion of definitions, we prove the existence of stationary Blackwell optimal policies under following main assumptions: (i) the state space is a Borel one; (ii) the action space is countable, the action sets are finite; (iii) the transition function is given by a transition density; (iv) a simultaneous Doeblin-type recurrence condition holds. The proof is based on an aggregation of randomized stationary policies into measures. Topology in the space of those measures is at the same time a weak and a strong one, and this fact yields compactness of the space and continuity of Laurent coefficients of the expected discounted reward. Another important tool is a lexicographical policy improvement. The exposition is mostly self-contained.Supported by the National Science Foundation.
Keywords:Discrete-time Markov decision process  Borel state space  transition densities  simultaneous Doeblin condition  Blackwell optimality
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号