Average cost Markov decision processes under the hypothesis of Doeblin期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Average cost Markov decision processes under the hypothesis of Doeblin

Authors:	Masami Kurano

Affiliation:	(1) Department of Mathematics, Faculty of Education, Chiba University, 260 Chiba, Japan

Abstract:	Average cost Markov decision processes (MDPs) with compact state and action spaces and bounded lower semicontinuous cost functions are considered. Kurano [7] has treated the general case in which several ergodic classes and a transient set are permitted for the Markov process induced by any randomized stationary policy under the hypothesis of Doeblin and showed the existence of a minimum pair of state and policy. This paper considers the same case as that discussed in Kurano [7] and proves some new results which give the existence theorem of an optimal stationary policy under some reasonable conditions.

Keywords:	Markov decision process average cost criterion Doeblin condition
本文献已被 SpringerLink 等数据库收录！