Continuous-Time Controlled Markov Chains with Discounted Rewards期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Continuous-Time Controlled Markov Chains with Discounted Rewards

Authors:	Xianping Guo Onésimo Hernández-Lerma

Affiliation:	(1) The School of Mathematics and Computational Science, Zhongshan University, Guangzhou, 510275, P.R. China;(2) Departamento de Matemáticas, CINVESTAV-IPN, A. Postal 14-740 México D.F. 07000, México

Abstract:	This paper studies denumerable state continuous-time controlled Markov chains with the discounted reward criterion and a Borel action space. The reward and transition rates are unbounded, and the reward rates are allowed to take positive or negative values. First, we present new conditions for a nonhomogeneous Q(t)-process to be regular. Then, using these conditions, we give a new set of mild hypotheses that ensure the existence of -optimal (0) stationary policies. We also present a martingale characterization of an optimal stationary policy. Our results are illustrated with controlled birth and death processes.

Keywords:	continuous-time controlled Markov chains unbounded reward and transition rates discounted criterion optimal stationary policies martingale characterization
本文献已被 SpringerLink 等数据库收录！