Blackwell optimal policies in a Markov decision process with a Borel state space |
| |
Authors: | A. A. Yushkevich |
| |
Affiliation: | (1) Department of Mathematics, University of North Carolina at Charlotte, 28223 Charlotte, NC, USA |
| |
Abstract: | After an introduction into sensitive criteria in Markov decision processes and a discussion of definitions, we prove the existence of stationary Blackwell optimal policies under following main assumptions: (i) the state space is a Borel one; (ii) the action space is countable, the action sets are finite; (iii) the transition function is given by a transition density; (iv) a simultaneous Doeblin-type recurrence condition holds. The proof is based on an aggregation of randomized stationary policies into measures. Topology in the space of those measures is at the same time a weak and a strong one, and this fact yields compactness of the space and continuity of Laurent coefficients of the expected discounted reward. Another important tool is a lexicographical policy improvement. The exposition is mostly self-contained.Supported by the National Science Foundation. |
| |
Keywords: | Discrete-time Markov decision process Borel state space transition densities simultaneous Doeblin condition Blackwell optimality |
本文献已被 SpringerLink 等数据库收录! |
|