Constrained Markov decision processes with first passage criteria期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Constrained Markov decision processes with first passage criteria

Authors:	Yonghui Huang Qingda Wei Xianping Guo

Institution:	1. School of Mathematics and Computational Science, Sun Yat-Sen University, Guangzhou, 510275, China

Abstract:	This paper deals with constrained Markov decision processes (MDPs) with first passage criteria. The objective is to maximize the expected reward obtained during a first passage time to some target set, and a constraint is imposed on the associated expected cost over this first passage time. The state space is denumerable, and the rewards/costs are possibly unbounded. In addition, the discount factor is state-action dependent and is allowed to be equal to one. We develop suitable conditions for the existence of a constrained optimal policy, which are generalizations of those for constrained MDPs with the standard discount criteria. Moreover, it is revealed that the constrained optimal policy randomizes between two stationary policies differing in at most one state. Finally, we use a controlled queueing system to illustrate our results, which exhibits some advantage of our optimality conditions.

Keywords:
本文献已被 SpringerLink 等数据库收录！