Blackwell optimal policies in a Markov decision process with a Borel state space期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Blackwell optimal policies in a Markov decision process with a Borel state space

Authors:	A. A. Yushkevich

Affiliation:	(1) Department of Mathematics, University of North Carolina at Charlotte, 28223 Charlotte, NC, USA

Abstract:	After an introduction into sensitive criteria in Markov decision processes and a discussion of definitions, we prove the existence of stationary Blackwell optimal policies under following main assumptions: (i) the state space is a Borel one; (ii) the action space is countable, the action sets are finite; (iii) the transition function is given by a transition density; (iv) a simultaneous Doeblin-type recurrence condition holds. The proof is based on an aggregation of randomized stationary policies into measures. Topology in the space of those measures is at the same time a weak and a strong one, and this fact yields compactness of the space and continuity of Laurent coefficients of the expected discounted reward. Another important tool is a lexicographical policy improvement. The exposition is mostly self-contained.Supported by the National Science Foundation.

Keywords:	Discrete-time Markov decision process Borel state space transition densities simultaneous Doeblin condition Blackwell optimality
本文献已被 SpringerLink 等数据库收录！